21 世纪 全 国 本 科 院 校 电气 信息 类 创新 型 应 用 人 才 培 养 规划 教材 


言 息 与 通信 专业 英语 


ež rž Utt 


7 PEKING UNIVERSITY PRESS 





内 容 简介 





AHR, 阅读 材料 和 摘要 阅读 组 成 , 内 容 分 别 选 自 各 领域 经 典 英文 教材 、 国 际 顶 级 期 刊 及 会 议 的 
论文 摘要 ， 紧 扣 信息 、 通 信 与 计算 三 大 主题 , 内 容 全 面 详实 。 

本 书 可 作为 信息 与 通信 工程 、 信 息 安全 、 电 子 科学 与 技术 、 电 子 信息 工程 、 计 算 机 科学 与 技术 和 网 络 
工程 等 专业 本 科 生 、 研 究 生 “专业 英语 ”课程 的 教材 ， 也 可 供 相关 专业 技术 人 员 学 习 和 参考 。 























图 书 在 版 编目 (CIP) 数 据 
信息 与 通信 专业 英语 / 刘 小 佳 主编 . 一 北京 : 北京 大 学 出 版 社 ， Xon 
(21 世纪 全 zz- AS 


























ISBN 978 - 7 - 301 - 25506 - 3 


I.@ 信 … TH. m. eU 一 教材 @@ 通 信 工 程 一 英语 一 高 等 





学 校 - 教 材 NV. DH31 


版 本 图 书馆 CIP 数据 核 字 
书 名 “信息 与 3 d rk 
著作 责任 者 X V 
责任 编辑 x 


bene As 78-7-301- asso A 


出 版 发 行 京 大 学 出 版 社 
址 北京 市 海淀 区 成 府 路 205 100871 
z 址 http://www. pup. cn 新 浪 微 博 : @ 北 京 大 学 出 版 社 
电子 信箱 pup_6@ 163. com 
"B i 邮购 部 62752015 ”发 行 部 62750672 ”编辑 部 62750667 
Ep 刷 者 
经 销 者 新 华 书店 
787 毫米 X1092 毫米 ”16 开本 13.75 印张 312 FF 
2015 年 4 月 第 1 版 2015 年 4 月 第 1 次 印刷 
定 价 29. 00 元 





H 




















未 经 许可 , 不 得 以 任何 方式 复制 或 抄袭 本 书 之 部 分 或 全 部 内 容 。 
版 权 所 有 ， 侵权 必 究 

举报 电话 : 010-62752024 ”电子 信箱 : fd@pup. pku. edu. cn 

图 书 如 有 印 装 质 量 问题 ,请 与 出 版 部 联系 , 电话: 010-62756370 


1} 


前 








多 年 来 ， SES Sie re MS Eee EAR: TIE 
专业 英语 教材 尽管 选择 了 非常 地 道 的 英文 原文 ， 但 其 内 容 止 于 “科普 ”， 未 能 达到 “ 专 
业 ”。 事 实 上 ， 文 史 哲 中 所 推崇 的 原 典 阅读 是 一 个 极 好 的 思路 ， 本 书 正 是 在 此 基础 上 展开 
了 有 益 的 尝试 。 

首先 ， 要 能 阅读 较为 专业 的 英语 文本 。 作 者 从 电子 信息 类 与 j 
教材 人 手 ， 精 选 若 干 原文 ， 模 拟 学 生 学 习 该 专业 课程 原文 教 
的 针对 性 。 显 然 ， 这 种 直接 阅读 大 师 原 典 的 做 法 不 但 能 提 
专业 方面 打开 知识 的 大 门 ， 为 进一步 阅读 经 典 教材 

其 次 ， 专 业 英 语 写作 也 是 很 重要 的 内 容 。 
































信 类 专业 领域 经 典 英文 
， 以 此 提高 专业 英语 
专业 英语 的 能 力 ， 还 能 在 
fili. 
领域 选 出 了 若干 国际 顶级 期 刊 与 





































会 议 的 论文 摘要 ， 针 对 国内 学 生 摘 要 写作 物 乱 加 进行 强化 训练 ， 以 此 aed 
文 写 作 的 能 力 ， 并 以 摘要 为 起 点 抛 砖 有 能力 的 学 生 进 行 完整 的 英语 科技 论 
写作 。 
sr AAD AE RISING LER, e 科学 与 技术 、 电 子 信息 工 
程 、 计 算 机 科 术 和 nd. 3f 算 三 大 主题 ， 突 出 
2d x LIME CAL REB Topo BE. IEIERUI A 
i ips rg) 从 而 更 好 地 提 综合 实力 。 


ne. - fit rh — B URGC LA S BSE A S f E DRE REAL, BEE 
Ns Alain 对 于 学 有 余力 的 学 生 ， 还 应 该 寻根 溯源 去 广泛 阅读 
对 应 的 国外 教材 ， 这 样 才能 达到 熟练 掌握 该 领域 英语 的 功效 。 为 了 帮助 读者 学 习 ， 作 者 对 
课文 中 的 词汇 给 出 了 在 该 领域 的 准确 释义 ， 并 精 选课 文中 的 阅读 难点 予以 注释 。 需 要 指出 
的 是 ， 读 者 应 仔细 研读 摘要 的 写法 ， 不 但 能 将 英文 翻译 为 中 文 ， 还 能 将 已 翻译 的 中 文 回 译 
成 英文 ， 多 年 的 教学 实践 证 明 ， 这 种 对 比 参照 的 双向 翻译 能 收 到 良好 的 效果 。 
感谢 西安 邮电 大 学 外 国语 学 院 袁 小 陆 教授 仔细 审阅 了 全 部 书稿 ， 并 提出 了 许多 中 肯 
建议 ! 感谢 西安 邮电 大 学 外 国语 学 院 诸多 同事 在 教材 编写 中 的 易 力 相助 ! 本 书 摘 选 诸多 
外 经 典 教材 的 英文 文本 ， 并 遂 选 多 篇 项 级 期 刊 会 议 的 论文 摘要 ， 在 此 一 并 对 原作 者 致谢 ! 
于 编者 学 识 所 限 ， 书 中 朴 漏 之 处 在 所 难免 ， 奶 请 读者 不 音 赐教 ! 
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UNIT 1 
INFORMATION THEORY 


Text: Introduction to Inform: heory 


Information theory answers two fundamé Session in communication theory: 


< 







What is the ultimate data compression (4 e entropy H), and what is the ultimate 


transmission rate of communication r: the channel capacity C). For this reason 


some consider information precy subset of commyifitation theory. We argue that it 
is much more. Indeed, it amental contributj © make in statistical physics 


(thermodynamics). co! 





ience (Kolmo, oris plexity or algorithmic complexi- 
ty). statistical inferénce K Occam's Razor; ,“ implest explanation is best"). and to 
probability an ees Cerror exponeni& timal hypothesis testing and estimation). 

This RN backward and forwakd through information theory and its naturally 
related ideas. Information theory intersects physics (statistical mechanics). mathematics 
(probability theory). electrical engineering (communication theory). and computer science 
(algorithmic complexity). We now describe the areas of intersection in greater detail. 

Electrical Engineering ( Communication Theory). In the early 1940s it was thought to 
be impossible to send information at a positive rate with negligible probability of error. 
Shannon surprised the communication theory community by proving that the probability of 
error could be made nearly zero for all communication rates below channel capacity. 

The capacity can be computed simply from the noise characteristics of the channel-", 
Shannon further argued that random processes such as music and speech have an irreducible 
complexity below which the signal cannot be compressed. This he named the entropy. in 
deference to the parallel use of this word in thermodynamics, and argued that if the entropy 
of the source is less than the capacity of the channel. asymptotically error-free communica- 


tion can be achieved. 
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Information theory today represents the extreme points of the set of all possible com- 


munication schemes. The data compression minimum I; lies at one extreme of the set 
of communication ideas. All data compression schemes require description rates at least e- 
qual to this minimum. At the other extreme is the data transmission maximum I(X;Y). 
known as the channel capacity. Thus. all modulation schemes and data compression 
schemes lie between these limits. 

Information theory also suggests means of achieving these ultimate limits of communi- 
cation, However. these theoretically optimal communication schemes. beautiful as they 
are, may turn out to be computationally impractical. It is only because of the computation- 
al feasibility of simple modulation and demodulation schemes that Ave use them rather than 

deu. 


the random coding and nearest-neighbor decoding rule sugge: annon's proof of the 






channel capacity theorem. Progress in integrated circuits à e design has enabled us to 


reap some of the gains suggested by Shannon's theo: putational practicality was fi- 
nally achieved by the advent of turbo codes. A ple of an application of the ideas 


of information theory is the use of error-cori spon des on compact discs and DVDs. 


f information theory has concentrated on 





many senders to many receiver; presence ea in ce and noise. Some of the 


Recent work on the communicati 
network information theory: the s the simultaneous rates of communication from 





trade-offs of rates ewe A ers and receive ected, and all have a certain 


mathematical simplicity, Pairs theory, h Xe to be found, 
Computer ^t imogorov Com] Mit olmogorov. Chaitin. and Solomonoff 
WP 


put forth they 





the complexity of Som of data can be defined by the length of the 
shortest MENS: mputer program for computing the string. Thus. the complexity is the 
minimal description length. This definition of complexity turns out to be universal. that is. 
computer independent, and is of fundamental importance ?', Thus. Kolmogorov complexi- 
ty lays the foundation for the theory of descriptive complexity. Gratifyingly. the Kolmog- 
orov complexity K is approximately equal to the Shannon entropy H if the sequence is 
drawn at random from a distribution that has entropy H. So the tie-in between information 
theory and Kolmogorov complexity is perfect. Indeed. we consider Kolmogorov complexity 
to be more fundamental than Shannon entropy. It is the ultimate data compression and 
leads to a logically consistent procedure for inference. 

There is a pleasing complementary relationship between algorithmic complexity and 
computational complexity. One can think about computational complexity (time complexi- 


ty) and Kolmogorov complexity (program length or descriptive complexity) as two axes 





corresponding to program running time and program length. Kolmogorov complexity focu- 


ses on minimizing along the second axis. and computational complexity focuses on minimi- 
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zing along the first axis. Little work has been done on the simultaneous minimization of the 
two. 

Physics( Thermodynamics). Statistical mechanics is the birthplace of entropy and the 
second law of thermodynamics. Entropy always increases. Among other things. the second 
law allows one to dismiss any claims to perpetual motion machines. 

Mathematics( Probability Theory and Statistics). The fundamental quantities of infor- 
mation theory—entropy, relative entropy, and mutual information—are defined as func- 
[3] 


tionals of probability distributions In turn. they characterize the behavior of long se- 


quences of random variables and allow us to estimate the probabilities of rare events (large 





deviation theory) and to find the best error exponent in pun tests. 


Philosophy of Science ( Occam's Razor). William of Occ; n GA “Causes shall not be 
multiplied beyond necessity,” or to paraphrase it, “The s st explanation is best, ”中 
Solomonoff and Chaitin argued persuasively that one universally good prediction pro- 


cedure if one takes a weighted combination of s that explain the data and ob- 





serves what they print next. Moreover. Ns will work in many problems not han- 






dled by statistics. For example, this pr: ASA eventually predict the subsequent digits 


of x. When this procedure is applied ips that come up heads with probability 0. 7, this 


li&d&o»he stock er should essentially 

arket and extrapo| optimally. In principle. such a 
procedure would have Pu ewton's laws of “om course, such inference is highly 
impractical, "n ing out all Mos rams that fail to generate existing data 


will take M VE ng. We would pré M happens tomorrow a hundred years from 


now. 


too will be inferred. When al 







find all the *laws" of the st 


Economics (Investment), Repeated investment in a stationary stock market results in 
an exponential growth of wealth. The growth rate of the wealth is a dual of the entropy 
rate of the stock market. The parallels between the theory of optimal investment in the 
stock market and information theory are striking. We develop the theory of investment to 
explore this duality. 

Computation vs. Communication, As we build larger computers out of smaller compo- 
nents, we encounter both a computation limit and a communication limit. Computation is 
communication limited and communication is computation limited. These become inter- 
twined, and thus all of the developments in communication theory via information theory 


should have a direct impact on the theory of computation. 
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New Words and Phrases 








transmission n. 传输 integrated circuit 集成 电路 

subset n. 子 集 reap vt. & vi. 收获 

thermodynamics n. 热力 学 turbo code turbo 编码 

algorithmic adj. 算法 的 simultaneous adj. 同时 的 ， 同 步 的 
statistical inference 统计 推断 trade-off n. 权衡 

exponent n. 指数 unifying vt. 使 统一 

optimal adj. 最 佳 的 ， 最 优 的 binary adj. 二 进 制 的 ， 二 元 的 
hypothesis n. 假设 tiecin n. 接头 1S 

intersect vi. 交叉 ， 相 交 M 缩 

complexity n. 复杂 度 compl tary adj. 互补 的 ， 补 充 的 


universal adj. 通用 的 , 广泛 的 , 普遍 的 ， axi 轴线 
宇宙 的 ` AA al motion 永恒 运动 
intersection n. 交集 K viation n. 偏差 ; 偏向 


mutual adj. 相互 的 ， 共 同 的 subsequent adj. 后 来 的 ， 随 后 的 


negligible adj. 微小 的 ; 可 忽 jx MAC EM . 推断， 推算 
i j. 最 佳 


irreducible adj. 不 能 分 解 的 2 UL 4 E 

entropy n. K 淘汰 ， SBR 
probability n. 概 3 nential n. 指数 

deference n. Xa n. XH 

WIE un 渐 近 地 adj. 指数 的 

modulation n. 调制 intertwine vi. & vt. 交织 ,纠缠 
feasibility n. 可 行 性 theorem n. 定理 

Notes 


1. The capacity can be computed simply from the noise characteristics of the channel. 
(信道 容量 只 用 信道 的 噪声 特征 即 可 计算 。 


2. Thus, the complexity is the minimal description length. This definition of complex- 




















ity turns out to be universal. that is. computer independent. and is of fundamental impor- 
tance. 

因此 ,复杂 度 是 最 小 描述 长 度 。 这 个 复杂 度 定义 是 通用 的 ,也 就 是 说 , 它 与 具体 计算 
机 无 关 并 且 有 着 根本 的 重要 性 。 
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3. The fundamental quantities of information theory—entropy. relative entropy. and 


mutual information—are defined as functionals of probability distributions. 

TESTE EAS tT. EX BOE 3 5 ET AY OTT PRU 

4. COccam's Razor) "Causes shall not be multiplied beyond necessity." or to para- 
phrase it, "The simplest explanation is best. " 

“如 无 必要 ， 勿 增 缘由 ”"， 也 就 是 “简单 的 解释 就 是 最 好 的 ”。 

Occam's Razor 奥 卡 姆 剃刀 原理 : 如 果 你 有 两 个 原理 ， 它 们 都 能 解释 观测 到 的 事实 ， 
那么 你 应 该 使 用 简单 的 那个 ， 直 到 发 现 更 多 的 证 据 。 对 于 现象 最 简单 的 解释 往往 比 复杂 的 
解释 更 正确 。 如 果 你 有 两 个 类 似 的 解决 方案 ， 选 择 最 简单 的 。 需 要 最 少 假设 的 解释 最 有 可 


能 是 正确 的 。 











Exercises 


I. Please translate the following ege into Chinese. 
. error-correcting “2 


probability theory = 
algorithmic complexity oS XP 
. large deviation the, 3 

. random proc zx XK 

. videi a es XM 


we 


. statis iL in iL inlerence 


go n gm og RI 


negligible P 


I. Fill in the blanks with the missing word(s) from the table below. 





























theorem. compression elements length 
capacity entropy random probability 
codes parallels transmission mutual 
defined block output exponent 
continuous duality redundancy constructing 
satisfy ratio capacity concept 
l. The relative D arises as the in the probability of error in a hy- 
pothesis test between two distributions. It is a natural measure of distance between distri- 


butions. 
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2. There are a number of between information theory and the theory of in- 
vestment in a stock market. A stock market is by a random vector X whose 
are nonnegative numbers equal to the of the price of a stock at the end of 


a day to the price at the beginning of the day. 
3. Entropy is the uncertainty of a single variable. We can define conditional 
entropy H(X | YO. which is the entropy of a random variable conditional on the knowledge 


of another random variable. 


4. We now define the (information) of the channel as the maximum of the 
information between the input and over all distributions on the input 
that the power constraint, 


5. We now introduce the concept of differential entropy. is the entropy of a 







random variable. Differential entropy is also re o the shortest description 


and is similar in many ways to the enone iscrete random variable. But 
there are some important differences. and t is! heed for some care in using the 


Rk existence of block codes that will allow 


us to transmit information at rates be with an arbitrarily small of 


error if the block length is large k X 

7. Although the hes Aid that there e with arbitrarily small 
probability of B for length: us 
best codes. x NA 


8. BN Re between isis of data and data transmission, 
T 


During comp + we remove all the in the data to form the most compressed 





6. The channel coding 





ot provide a way of the 





version possible. whereas during data + we add redundancy in a controlled fashion 


to combat errors in the channel. 
II. Translate the following paragraphs into Chinese. 


1. At first sight. information theory and gambling seem to be unrelated. But as we 
shall see, there is strong duality between the growth rate of investment in a horse race and 
the entropy rate of the horse race. Indeed, the sum of the growth rate and the entropy rate 
is a constant. In the process of proving this, we shall argue that the financial value of side 
information is equal to the mutual information between the horse race and the side informa- 
tion. The horse race is a special case of investment in the stock market. We also show how 
to use a pair of identical gamblers to compress a sequence of random variables by an amount 
equal to the growth rate of wealth on that sequence. Finally. we use these gambling tech- 


niques to estimate the entropy rate of English. 
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2. This enables us to divide the set of all sequences i sets, the typical set, 
where the sample entropy is close to the true entropy, a ontypical set. which con- 
tains the other sequences. Most of our attention wi M the typical sequences. Any 
property that is proved for the typical sequences. 3d true with high probability and 


will determine the average behavior of a RY 



































Reading: Channel Capacity 


What do we mean when we say that A communicates with B? We mean that the physi- 
cal acts of A have induced a desired physical state in B. This transfer of information is a 


physical process and therefore is 





subject to the uncontrollable ambient noise and imperfec- 
tions of the physical signaling process itself. The communication is successful if the receiv- 
er B and the transmitter A agree on what was sent. 

We find the maximum number of distinguishable signals for n uses of a communication 


channel. This number grows exponentially with n, and the exponent is known as the 
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channel capacity. The characterization of the channel capacity (the logarithm of the number 
of distinguishable signals) as the maximum mutual information is the central and most fa- 
mous success of information theory. 

The mathematical analog of a physical signaling system is shown in Figure 1. 1. Source 
symbols from some finite alphabet are mapped into some sequence of channel symbols. 
which then produces the output sequence of the channel. The output sequence is random 
but has a distribution that depends on the input sequence. From the output sequence. we 
attempt to recover the transmitted message. 

Each of the possible input sequences induces a probability distribution on the output 
sequences, Since two different input sequences may give rise to the same output sequence. 
the inputs are confusable. In the next few sections. we show AS. can choose a “non- 


confusable" subset of input sequences so that with high pr: there is only one highly 






likely input that could have caused the particular iy 
sequences at the output with a negligible probabili xe 
annel. 


the appropriate "widely spaced" input sequences 


then reconstruct the input 
. By mapping the source into 
we can transmit a message 
with very low probability of error and OBS e source message at the output. The 


maximum rate at which this can be dong i E the capacity of the channel. 







Message 


ZA. x Figure 1.1 ca System 


Definiti b ne a discrete ca (0 be a system consisting of an input alphabet 
X and output UD abet Y and a subs transition matrix p(y | 2) that expresses the 
probability of observing the output symbol y given that we send the symbol x. The channel 
is said to be memor yless if the probability distribution of the output depends only on the in- 
put at that time and is conditionally independent of previous channel inputs or outputs. 
Definition We define the "information" channel capacity of a discrete memoryless 
channel as where the maximum is taken over all possible input distributionspCr). 
C — maxICX;Y) Q-D 


pon 


We shall soon give an operational definition of channel capacity as the highest rate in 
bits per channel use at which information can be sent with arbitrarily low probability of er- 
ror. Shannon's second theorem establishes that the information channel capacity is equal to 
the operational channel capacity. Thus. we drop the word information in most discussions 
of channel capacity. 

There is a duality between the problems of data compression and data transmission. 


During compression. we remove all the redundancy in the data to form the most com- 
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pressed version possible. whereas during data transmission. we add redundancy in a con- 





trolled fashion to combat errors in the channel. Later we show that a general communica- 
tion system can be broken into two parts and that the problems of data compression and da- 


ta transmission can be considered separately. 


New Words and Phrases 


ambient adj. 背景 的 discrete adj. 离散， 离散 的 
imperfection n. 缺陷 arbitrarily adv. 任意 地 
exponentially adv. 成 指数 地 duality n. 对 偶 ， 二 元 性 
exponent n. FER. FE compression n. 

analog n. 模拟 wg Kats TUR BE 
negligible adj. 可 忽略 不 计 的 


Exercises AS 
I. Answer the following SS ~ 
1. What is the discrete nel? ee 


2. How to calcula 


3. What is d ancy? x 
1 axe 
I. Tails e following sentences ñt Chinese. 


l. This transfer of information is a physical process and therefore is subject to the un- 






channel capacity? 


controllable ambient noise and imperfections of the physical signaling process itself. 








2. Source symbols from some finite alphabet are mapped into some sequence of chan- 


nel symbols. which then produces the output sequence of the channel. 








3. By mapping the source into the appropriate "widely spaced" input sequences to the 
channel. we can transmit a message with very low probability of error and reconstruct the 


source message at the output. 
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4. During compression. we remove all the redundancy in the data to form the most 
compressed version possible, whereas during data transmission. we add redundancy in a 


controlled fashion to combat errors in the channel. 











Abstract Reading 


On the Capacity of the Two-user Gaussian Causal Cogni ct- Channel 


This paper considers the two-user Gaussian caus! itive interference channel (GC- 






CIC). which consists of two source-destinati rs Xhat share the same channel and 






where one full-duplex cognitive source can AN arn the message of the primary source 
through a noisy link. The GCCIC is an ¥ as nce channel with unilateral source coopera- 


tion that better models practical 1 radio networks than the commonly used model 
which assumes that one source SX noncausal k 

sage. First. the Mew s. symmetric GG 

gap. Then. the insigh ined from the UR symmetric GCCIC are extended to 


more general cases 











e of the other source's mes- 





is/determined to within a constant 






particular, the ácity region of the Gaussian Z-channel. 
i.e. , when thax is ho interference from primary user, and of the Gaussian S-channel, 
i.e., when th is no interference from the secondary user. are both characterized to 
within 2 bits. The fully connected general. i. e. . no-symmetric. GCCIC is also considered 
and its capacity region is characterized to within 2 bits when, roughly speaking. the inter- 
ference is not weak at both receivers. The parameter regimes where the GCCIC is equiva- 
lent, in terms of generalized degrees-of-freedom. to the noncooperative interference chan- 
nel (i. e. . unilateral causal cooperation is not useful), to the non-causal cognitive interfer- 
ence channel (i. e. , causal cooperation attains the ultimate limit of cognitive radio technolo- 
gy), and to bilateral source cooperation are identified. These comparisons shed light into 
the parameter regimes and network topologies that in practice might provide an unbounded 


throughput gain compared to currently available (non cognitive) technologies. 


Channel Coding and Lossy Source Coding Using a Generator of Constrained 


Random Numbers 


Stochastic encoders for channel coding and lossy source coding are introduced with a 
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rate close to the fundamental limits. where the only restriction is that the channel input al- 
phabet and the reproduction alphabet of the lossy source code are finite. Random numbers. 
which satisfy a condition specified by a function and its value. are used to construct sto- 
chastic encoders. The proof of the theorems is based on the hash property of an ensemble 
of functions, where the results are extended to general channels/sources and alternative 
formulas are introduced for channel capacity and the rate-distortion region. Since an en- 
semble of sparse matrices has a hash property. we can construct a code by using sparse ma- 


trices. 
Is Non-unique Decoding Necessary? 


In multiterminal communication systems, signals carryi ges meant for differ- 






ent destinations are often observed together at any give; ination receiver. Han and 


Kobayashi proposed a receiving strategy. which AS int unique decoding of messa- 
ges of interest along with a subset of messages. e not of interest, It is now well- 
known that this provides an achievable regio: is. in general. larger than if the re- 

By Nair and El Gamal and Chong, Motani, 


a generalization called indirect or nonunique 


^s ebook Wi ^ m to uniquely decode 
unique E een used in various scenarios. 








ceiver treats all messages not of interest/as 


Garg, and El Gamal independent] 
decoding where the receiver us 


" . = 
only its messages of interest? 









is to provide an etation and a systematic proof tech- 
nique for why no ue Hecoding. in all xo s 


dra 
replaced by a NS rly designed joint iue decoding strategy. without any penalty 
QN 


Írom a rate viewpoint, 


The main result in this Pape 


Ses where it has been employed. can be 





UNIT 2 


ALGORITHMS 


Text: Analyzing Algor 


X 


Analyzing an algorithm has come to mean i¢ting the resources that the algorithm 
: ; ~ a ; 
requires. Occasionally. resources such as ary. communication bandwidth, or comput- 
er hardware are of primary concern, st'often it is computational time that we want 


to measure. Generally. by analyzihg"several candidate alggyjthms for a problem. we can i- 







dentify a most efficient ones 2 nalysis may indi re than one viable candidate. 


but we can often discard sévera 7 m algorithm 





> process. 

Before we can-arfalyzd an algorithm. we,mi ve a model of the implementation tech- 
nology that EN illviise. including a moje the resources of that technology and their 
costs. We shal á ume a generic one procedsors random-access machine ( RAM)! model of 
computation as our implementation technology and understand that our algorithms will be 
implemented as computer programs. In the RAM model. instructions are executed one af- 
ter another. with no concurrent operations. 

Strictly speaking. we should precisely define the instructions of the RAM model and 
their costs. To do so. however. would be tedious and would yield little insight into algo- 
rithm design and analysis. Yet we must be careful not to abuse the RAM model. For ex- 
ample, what if a RAM had an instruction that sorts? Then we could sort in just one in- 
struction. Such a RAM would be unrealistic. since real computers do not have such in- 
structions, Our guide. therefore. is how real computers are designed. The RAM model 
contains instructions commonly found in real computers: arithmetic (such as add, sub- 
tract, multiply, divide, remainder, floor, ceiling), data movement (load, store, copy), 
and control (conditional and unconditional branch. subroutine call and return)”!, Each 


such instruction takes a constant amount of time. 
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The data types in the RAM model are integer and floating point (for storing real num- 
bers). Although we typically do not concern ourselves with precision in this book. in some 
applications precision is crucial. We also assume a limit on the size of each word of data. 
For example. when working with inputs of size n. we typically assume that integers are re- 
presented by c lg n bits for some constant c > 1. We require c Z 1 so that each word can 
hold the value of n. enabling us to index the individual input elements, and we restrict c to 
be a constant so that the word size does not grow arbitrarily. (If the word size could grow 
arbitrarily. we could store huge amounts of data in one word and operate on it all in con- 
stant time—clearly an unrealistic scenario. ) 


Real computers contain instructions not listed above. and such-instructions represent a 






gray area in the RAM model. For example. isexponentiatioi “fant time instruction? 


In the general case. no; it takes several instructions to c x when x and y are real 


numbers. In restricted situations. however. expone iS a constant-time operation. 


igh 
h shifting the bits of an integer by one 






Many computers have a “shift left” instruction. constant time shifts the bits of an 


integer by k positions to the left, In most Ew) 
position to the left is equivalent to mult? i 








ici by 2. so that shifting the bits by & posi- 


tions to the left is equivalent to multi ion by 2^. Therefore. such computers can com- 
pute 2k in one constant-time inst i y shifting eis by & positions to the left. 
Tg of bits in word. We will endeavor to a- 


as long as Ais no more than 





void such gray areas in model. but reat computation of 2* as a constant- 





time operation w is 
- 


In the EN 1, we do not atte model the memoryhierarchy that is common 


in contemporarj Computers, That is. we do not model caches or virtual memory. Several 


small enough pgsiti teger. 


computational models attempt to account for memory-hierarchy effects, which are some- 
times significant in real programs on real machines. A handful of problems in this book ex- 
amine memory-hierarchy effects. but for the most part. the analyses in this book will not 
consider them. Models that include the memory hierarchy are quite a bit more complex 
than the RAM model. and so they can be difficult to work with. Moreover. RAM-model 
analyses are usually excellent predictors of performance on actual machines. 

Analyzing even a simple algorithm in the RAM model can be a challenge. The mathe- 
matical tools required may includecombinatorics. probability theory. algebraic dexterity, 
and the ability to identify the most significant terms in a formula. Because the behavior 
of an algorithm may be different for each possible input. we need a means for summarizing 
that behavior in simple. easily understood formulas. 

Even though we typically select only one machine model to analyze a given algorithm. 


we still face many choices in deciding how to express our analysis. We would like a way 








that is simple to write and manipulate. shows the important characteristics of an 


algorithm's resource requirements. and suppresses tedious details. 
Analysis of Insertion Sort 


The time taken by the INSERTION-SORT procedure depends on the input: sorting a 
thousand numbers takes longer than sorting three numbers. Moreover. INSERTION- 
SORT can take different amounts of time to sort two input sequences of the same size de- 
pending on how nearly sorted they already are. In general. the time taken by an algorithm 


grows with the size of the input. so it is traditional to describe the running time of a pro- 


time" and "size of input" more carefully. 







gram as a function of the size of its input. To do so. we need to N the terms “running 

The best notion for input size depends on the proble: studied. For many prob- 
lems, such as sorting or computing discrete Fourier Ao s, the most natural measure 
is the number of items in the input—for examp! ray size n for sorting. For many 
other problems. such as multiplying two SR e best measure of input size is the to- 


tal number of bits needed to represent the,in n ordinary binary notation. Sometimes, it 


is more appropriate to describe the siz € input with two numbers rather than one. For 
instance, if the input to an algo is a graph. the qe can be described by the 
e which input size measure is 


numbers of vertices and edg Athe graph. We s i 
being used with each das we study. NE 
The running a Sd an algorithm on a jcülab-input is the number of primitive operations 
or "steps" ex Tis convenient to defi il die motion of step so that it is as machine-independ- 
ent as abe the moment, let us adopt the following view. A constant amount of time is 
required to execute each line of our pseudocode. One line may take a different amount of time than 
another line, but we shall assume that each execution of the ;^ line takes time c;, where c; is a 
constant, This viewpoint is in keeping with the RAM model. and it also reflects how the pseud- 


ocode would be implemented on most actual computers. 


New Words and Phrases 








bandwidth n. 带宽 concurrent adj. FRÆK; 同时 完成 的 ; 
inferior adj. 下 等 的 同时 存在 的 

n. 下级， 次 品 n。 共 点 ; 同时 发 生 的 事件 
implementation n. 实现 ， 懂 行 ， 安 装 启用 remainder n. Ars 
generic adj. 72 (bi) adj. FAHY 


execute vt， 执 行 。 运行 ceiling n， 向 上 取 整 











subroutine n， 子 程序 predictor n. HWE. WFF 





integer n. RC combinatoric adj. 组 合 学 的 
floating n， 浮 点 数 manipulate vt. 处理， 控制 
scenario n. insertion-sort 2。 插入 排序 
exponentiation n. RHE. HAE discrete adj. 离散 的 
equivalent adj. FASE AY. FAY AY. 等 效 的 ; binay adj. 
等 价 的 ， 等 积 的 n 二进制 数 

n， 对 等 物 vertices n. 顶点 
endeavor vt. & vi. 尝试 ; 尽力 primitive n. 元 数据 类 型 

n BA, BH preudocode n. Aft 
hierarchy n. 4. 层次 messy formul sius 
cache n， 高 速 缓 冲 存储 器 manipulate Sat 操作 
memory-hierarchy n， 存 储 层次 ; 分 级 存储 notati is. AS 


器 体系 


Notes 











RS 
GN 
SA 


ide 称 作 “随机 存储 器 "， 是 
CPU 直接 交换 数据 的 ffs. dump EAE. ter 时 读 写 ， 而 且 速 度 很 快 ， 通 常 作 
为 操作 系统 或 其 他 亚 在 运行 的 程序 的 临时 dta. 


2. The NN. el contains instrü 3 commonly found in real computers: arith- 
ás 


» subtract, multiply, divide. remainder, floor. ceiling), data movement 





l. random-access mac! 





metic (such 
(load. store. copy). and control (conditional and unconditional branch. subroutine call 
and return). RAM 模型 中 包含 的 指令 一 般 可 以 在 计算 机 中 找到 : 算术 (如 加 、 减 、 乘 、 
除 、 余 数 、 向 下 取 整 、 向 上 取 整 )， 数 据 传送 (如 加 载 、 存 储 、 复 制 ) 和 控制 (如 有 条 件 转移 
和 无 条 件 转移 、 子 程序 调用 和 返回 ) 。 


3. The mathematical tools required may includecombinatorics. probability theory. al- 





gebraic dexterity, and the ability to identify the most significant terms in a formula. 所 需 


的 数学 工具 可 能 包括 : 组 合 数学 、 概 率 论 、 代 数 技巧 及 识别 公式 中 有 效 项 的 能 力 。 








Exercises 


I. Translate the following words and phrases into Chinese. 


1. implementation technology 











concurrent operation 





unrealistic scenario 
positive integer 


virtual memory 


2. 

ES 

4. 

5. 

6. running time 
7. input size 

8. discrete Fourier transforms 
9. binary notation 

1 


0. primitive operation 


I. Fill in the blanks with the missing word(s) from the tab! low. 





































convenient implementation introduces 
distribution software reverse 

averaging notion 'enario alternatives 
random input infinitely insertion 
probabilities theoreti array efficient 







RJ 


1. If computers were — for solving a problem would 





ast, any MZ 


the bounds of good en- 


do. You would probabl an 
gineering practice ex =MR your M y should be well designed and docu- 





mented), ANS uld most often us ver method was the easiest to implement. 
2. E bes many algorithms + including the hiring problem. we use indica- 
tor des. Indicator random variables provide a method for conver- 
ting between and expectations. 
3. We must be very careful in deciding on the distribution of . For some 
problems. we may reasonably something about the set of all possible inputs. and 
then we can use probabilistic analysis as a technique for designing an algorithm 


and as a means for gaining insight into a problem. For other problems. we cannot describe 


a reasonable input + and in these cases we cannot use probabilistic analysis. 
4. This serves as a model for a common computational paradigm. We often 
need to find the maximum or value in a sequence by examining each element of 


the sequence and maintaining a current "winner". The hiring problem models how often we 
update our of which element is currently winning. 
5. Most of the algorithms we discuss have great practical utility. We therefore ad- 


dress implementation concerns and other engineering issues. We often provide practical 








to the few algorithms that are primarily of interest, 
6. In some cases. we assume that the inputs conform to a known probability distribu- 
tion. so that we are the running time over all possible inputs. 
7. Having specified the sort algorithm. we then argue that it correctly 


sorts, and we analyze its running time. The analysi a notation that focuses on 





how that time increases with the number of items to be sorted. 
8. In our analysis of insertion sort. we looked at both the best case. in which the input 


was already sorted, and the worst case. in which the input array was sorted, 


II. Translate the following paragraphs into Chinese. 


ning time for any input. Knowing it provides a guarantee e algorithm will never take 


l. The worst-case running time of an algorithm gives $ er bound on the run- 
t 


unning time and hope that it 





any longer. We need not make some educated guess aes 
never gets much worse. s4 




















2. For sone A ns, the worst ca fairly often. For example. in searching 
a database fi articular piece of infor. the searching algorithm’s worst case will 
often onc in the information is not present in the database. In some applications. 


searches for absent information may be frequent. 




















3. Often. we shall assume that all inputs of a given size are equally likely. In prac- 
tice, this assumption may be violated. but we can sometimes use a randomized algorithm, 
which makes random choices. to allow a probabilistic analysis and yield an expected run- 


ning time. 
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Reading: Multithreaded Ae” 


The vast majority of algorithms are serial algorit Ra for running on a uniprocessor 





computer in which only one instruction executes at this paper, we shall extend our al- 


that permits multiple instructions to execu ‘ently, In particular. we shall explore the ele- 






gorithmic model to encompass parallel RES can run on a multiprocessor computer 
ÓI 


gant model of dynamic multithreaded a 
nalysis, as well as to efficient impl tàtion in practice. 

Parallel inner Ee with multiple NT units—have become increas- 
ingly common, and they%pan a wide range of pri Ne. performance. Relatively inexpen- 
sive desktop and p chip multiprocess NUS a single multicore integrated-circuit 


chip that So le processing "V ach of which is a full-fledged processor that 
n 


hon memory. At an intermediate price/performance point are clusters 


as. which are amenable to algorithmic design and a- 








can access a ci 
built from individual computers—often simple PC-class machines— with a dedicated net- 
work interconnecting them. The highest-priced machines are supercomputers. which often 
use a combination of custom architectures and custom networks to deliver the highest per- 
formance in terms of instructions executed per second. Multiprocessor computers have 
been around. in one form or another. for decades. Although the computing community set- 
tled on the random-access machine model for serial computing early on in the history of 
computer science, no single model for parallel computing has gained as wide acceptance. A 
major reason is that vendors have not agreed on a single architectural model for parallel 
computers, For example. some parallel computers feature shared memory. where each 
processor can directly access any location of memory. Other parallel computers employ dis- 
tributed memory. where each processor's memory is private. and an explicit message must 
be sent between processors in order for one processor to access the memory of another. 


With the advent of multicore technology. however. every new laptop and desktop machine is now 
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a shared-memory parallel computer and the trend appears to be toward shared-memory multipro- 
cessing. Although time will tell. that is the approach we shall take in this paper. 

One common means of programming chip multiprocessors and other shared memory 
parallel computers is by using static threading. which provides a software abstraction of 
"virtual processors". or threads. sharing a common memory. Each thread maintains an as- 
sociated program counter and can execute code independently of the other threads. The op- 
erating system loads a thread onto a processor for execution and switches it out when an- 
other thread needs to run. Although the operating system allows programmers to create 
and destroy threads. these operations are comparatively slow. Thus. for most applica- 
tions. threads persist for the duration of a computation. which is why we call them "stat- 


ic". Unfortunately. programming a shared-memory parallel directly using static 








threads is difficult and error-prone. One reason is that dy: ly partitioning the work a- 
mong the threads so that each thread receives approxi 
a complicated undertaking. For any but the sim plications, the programmer must 
use complex communication protocols to v 1 scheduler to load-balance the work. 
This state of affairs has led toward the 


pato concurrency platforms. which provide a 






layer of software that coordinates üles. and manages the parallel-computing re- 





sources. Some concurrency platf € built as runtim ries. but others provide full- 
fledged parallel languages gone and runti 
Dynamic Multit di ming a 
S s of concurrency n is dynamic multithreading. which is the 


One imp 
model we Pet in this paper. Dynamic multithreading allows programmers to specify 
parallelism in applications without worrying about communication protocols. load balan- 
cing, and other vagaries of static-thread programming. The concurrency platform contains 
a scheduler. which load-balances the computation automatically, thereby greatly simplify- 
ing the programmer's chore. Although the functionality of dynamic-multithreading envi- 
ronments is still evolving. almost all support two features: nested parallelism and parallel 
loops. Nested parallelism allows a subroutine to be *spawned". allowing the caller to pro- 
ceed while the spawned subroutine is computing its result. A parallel loop is like an ordina- 
ry for loop. except that the iterations of the loop can execute concurrently. 

These two features form the basis of the model for dynamic multithreading that we 
shall study in this paper. A key aspect of this model is that the programmer needs to speci- 
Íy only the logical parallelism within a computation. and the threads within the underlying 
concurrency platform schedule and load-balance the computation among themselves. We 


shall investigate multithreaded algorithms written for this model. as well how the underly- 
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ing concurrency platform can schedule computations efficiently. 

Our model for dynamic multithreading offers several important advantages. 

(1) It is a simple extension of our serial programming model. We can describe a mul- 
tithreaded algorithm by adding to our pseudocode just three “concurrency” keywords: par- 
allel, spawn, and sync. Moreover, if we delete these concurrency keywords from the mul- 
tithreaded pseudocode, the resulting text is serial pseudocode for the same problem, which 
we call the “serialization” of the multithreaded algorithm. 

(2) It provides a theoretically clean way to quantify parallelism based on the notions 
of “work” and "span". 


the divide-and-conquer paradigm. Moreover, just as serial d-conquer algorithms 


(3) Many multithreaded algorithms involving nested Nu follow naturally from 







lend themselves to analysis by solving recurrences. so do tithreaded algorithms, 


(4) The model is faithful to how parallel-computin; tice is evolving. A growing number 
of concurrency platforms support one variant or a dynamic multithreading, including 


Cilk, Cilk++, Open MP. Task Parallel Ma Eds Building Blocks. 


New Words and Phrases WS 
multithreaded algorithms a "o 调换 
uniprocessor n. ir E ot adj. MT 

ition 2， 划分 ， 分 割 ， 隔离 物 


encompass vt. 
AS "TI Xe zt。 区 分 , 分 割 


parallel com f 行 计算 机 protocol». (数据 传递 的 ) 协 议 ; 科学 实验 报 
span vt. ， 测 量 告 (或 计划 ) 
sis n. gh compiler n. 编译 程序 
circuit chip 电路 芯片 dynamic multithreading 动态 多 线程 
full-fledged adj. 成 熟 的 ; 完全 的 ; 完备 的 ”vagary n. 奇想， 奇特 行为 
cluster n， 群 集 spawn vt. & vi. KEPE. 运行 子 程序 
vendor n， 经 销 商 subroutine n. FEF 
explicit adj. 明确 的 ， 清 楚 的 , 详 述 的 loop 7。 循环 
static acdjJ。 静 止 的 ， 静 电 的 pseudocode n， 伪 代码 
abstraction n， 抽 象 syne n， 同 步 
switch n. JPR; 转换。 转换 器 serialization n， 序 列 化 
ut. & vi. REAR, 改变， 转换， 关闭  divide-and-conquer paradigm 分 而 治之 范式 
电流 





ot. Fei, 迅速 转动 











Exercises 


I. Answer the following questions. 


1. What is parallel computer? 

2. Why programming a shared-memory parallel computer directly using static threads 
is difficult and error-prone? 

3. What does dynamic multithreading allow programmers to do? 


4. What important advantages does our model for dynamic multithreading offer? 


I. Translate the following sentences into Chinese. SS 
l. In particular. we shall explore the elegant $ ynamic multithreaded algo- 
rithms, which are amenable to algorithmic RN alysis. as well as to efficient im- 


plementation in practice. m x 

















2. Parallel Xie NW Mie processing units—have become in- 
n. 


creasingly NS Jahd they span a wid ane of prices and performance. 














3. One common means of programming chip multiprocessors and other shared memory 
parallel computers is by using static threading. which provides a software abstraction of 


“virtual processors”. or threads, sharing a common memory. 

















4. Dynamic multithreading allows programmers to specify parallelism in applications 
without worrying about communication protocols. load balancing. and other vagaries of 


static-thread programming. 
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Abstract Reading 


On the Complexity of Time-dependent Shortest Paths 


We investigate the complexity of shortest paths in tí endent graphs where the 


costs of edges (that is, edge travel times) vary as a iph of time, and as a result the 






shortest path between two nodess and d can e OVer time. Our main result is that 


when the edge cost functions are biet, ss. 


logo) 


ecewise linear. the shortest path from 






s to d can change n times, settli sexéral-year-old conjecture of Dean (Technical 


Reports. 1999, 2004). However ., des he fact that the arrival time function may have 
superpolynomial complexity, hat a minimum Dath for any departure time 
interval can be computed poe time. We hoW that the complexity is polyno- 
mial if the slopes of v" function come fr d class and describe an efficient 
scheme for comp 'a ( rege travel time function. 

atin Partition of Some Class bf Graphs under Compaction and 
Vertex-compaction 


The compaction problem is to partition the vertices of an input graph G onto the verti- 
ces of a fixed target graph H, such that adjacent vertices of G remain adjacent in H, and 
every vertex and non-loop edge of H is covered by some vertex and edge of G respectively. 
i.e. . the partition is a homomorphism of G onto H (except the loop edges). Various com- 
putational complexity results. including both NP-completeness and polynomial time solva- 
bility. have been presented earlier for this problem for various classes of target graphs H. 
In this paper. we pay attention to the input graphs G. and present polynomial time algo- 
rithms for the problem for some class of input graphs. keeping the target graph H general 
as any reflexive or irreflexive graph. Our algorithms also give insight as for which in- 
stances of the input graphs. the problem could possibly be NP-complete for certain target 
graphs. With the help of our results. we are able to further refine the structure of the in- 


put graph that would be necessary for the problem to be possibly NP-complete. when the 











target graph is a cycle. Thus. when the target graph is a cycle. we enhance the class of in- 
put graphs for which the problem is polynomial time solvable. We also present analogous 
results for a variation of the compaction problem. which we call the vertex-compaction 
problem. Using our results. we also provide important relationships between compaction, 


retraction, and vertex-compaction to cycles. 
Sublinear Algorithms for Approximating String Compressibility 


We raise the question of approximating the compressibility of a string with respect to a 
fixed compression scheme. in sublinear time. We study this question in detail for two pop- 
ular lossless compression schemes; run-length encoding (RLE) and. a variant of Lempel-Ziv 
(LZ77) . and present sublinear algorithms for approximatin, js with respect 
to both schemes. We also give several lower bounds that E our algorithms for both 
schemes cannot be improved significantly. 


Our investigation of LZ77 yields results whoSe\interest goes beyond the initial ques- 







tions we set out to study. In particular. we av binatorial structural lemmas that re- 
t 


late the compressibility of a string with’ xes| o LZ77 to the number of distinct short 


substrings contained in it (its /^ su mplexity, for small /). In addition, we show 


that approximating the compressi ith respect to I is related to approximating the 
support size of a diseased 


s 





UNIT > 


IMAGE COMPHESSION 


Text: Approaches to Image ression 


An image compression method is normally sd for a specific type of image, and 
this section lists various approaches to sgh g images of different types. Only the 
general principles are discussed here. 

Approach 1: This is appropria SS 
presented by one bit. Ampli principle of i pression to a bi-level image 
therefore means that they hte neighbors of sie tend to be identical to P. Thus. 
ngth encoding-^ CR compre; 


: 一 : 
sion method sr image may scan) Ardster order (row by row) and compute the 


l : "nm doe 2 A 
bi-level images! pixel in such an image is re- 





it makes sense to usetru such an image. A compres- 





lengths of runs lack and white pixels. the lengths are encoded by variable-length codes 
and are written on the compressed stream. It should be stressed that this is just an ap- 
proach to bi-level image compression. The details of specific methods vary. For instance. a 
method may scan the image column by column or in zigzag. it may convert the image to a 
quad tree. or it may scan it region by region using a space-filling curve”. 

Approach 2; Also for bi-level images. The principle of image compression tells us that 
the neighbors of a pixel tend to be similar to the pixel. We can extend this principle and 
conclude that if the current pixel has color ¢ (where c is either black or white). then pixels 
of the same color seen in the past (and also those that will be found in the future) tend to 
have the same immediate neighbors. 

This approach looks at n of the near neighbors of the current pixel and considers them 
an n-bit number. This number is the context of the pixel. In principle there can be 2" con- 
texts. but because of image redundancy we expect them to be distributed in a nonuniform 


way. Some contexts should be common while others will be rare. 
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The encoder counts how many times each context has already been found for a pixel of 





color c. and assigns probabilities to the contexts accordingly. If the current pixel has color 
c and its context has probability p, the encoder can use adaptive arithmetic coding to en- 
code the pixel with that probability. This approach is used by JBIG-". 

Next. we turn to grayscale images. A pixel in such an image is represented by n bits 
and can have one of 2" values. Applying the principle of image compression to a grayscale 
image implies that the immediate neighbors of a pixel P tend to be similar to P, but are not 
necessarily identical. Thus. RLE should not be used to compress such an image. Instead. 
two approaches are discussed. 

Approach 3; Separate the grayscale image into n bi-level images and compress each 


with RLE and prefix codes. The principle of image compress} s to imply intuitively 







that two adjacent pixels that are similar in the grayscale i: will be identical in most of 


the n bi-level images. This. however. is not true. as ollowing example makes clear. I- 







magine a grayscale image with n=4 (i. e. , 4-bi Sr 16 shades of gray). The image 





can be separated into four bi-level images. I&t 
~ 


S 


cent pixels with values 0111 and 1000 are also 


jacent pixels in the original grayscale 


image have values 0000 and 0001, then Similar. They are also identical in three of 






the four bi-level images. However. t 


similar in the grayscale image (t es are 7 and 8, &pectively) but differ in all four 


bi-level images. > 

This problem occu IK the binary c incen integers may differ by sever- 
al bits, The — and 1 differ.by t. those of 1 and 2 differ by two bits. 
and those of 7«and T üffer by four bits. solution is to design special binary codes such 
that the codes y consecutive integers and i+1 will differ by one bit only. 


Approach 4; Use the context of a pixel to predict its value. The context of a pixel is 
the values of some of its neighbors. We can examine some neighbors of a pixel P. compute 
an average A of their values. and predict that P will have the value A. The principle of im- 
age compression tells us that our prediction will be correct in most cases. almost correct in 
many cases, and completely wrong in a few cases. We can say that the predicted value of 
pixel P represents the redundant information in P. We now calculate the difference 

A B P—A (3-1) 

and assign variable-length codes to the different values of A such that small values 

(which we expect to be common) are assigned short codes and large values (which are ex- 

pected to be rare) are assigned long codes. IfP can have the values 0 through m—1, then 

values of A are in the range [—(@m—1). +-Gn—1)]. and the number of codes needed is 2 
(m— 1) 十 1 or 2m— 1. 


Experiments with a large number of images suggest that the values ofA tend to be dis- 
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tributed according to the Laplace distribution". A compression method can. therefore. 
use this distribution to assign a probability to each value of A. and use arithmetic coding to 
encode the A values very efficiently. This is the principle of the MLP method. 

The context of a pixel may consist of just one or two of its immediate neighbors. 
However. better results may be obtained when several neighbor pixels are included in the 
context. The average A in such a case should be weighted, with near neighbors assigned 
higher weights. Another important consideration is the decoder. In order for it to decode 
the image. it should be able to compute the context of every pixel. This means that the 
context should employ only pixels that have already been encoded. If the image is scanned 
in raster order. the context should include only pixels located above the current pixel or on 


the same row and to its left. 


Approach 5; Transform the values of the pixels and e the transformed values. 
The concept of a transform. as well as the most im t transforms used in image com- 
pression, is devoted to the wavelet transform. | that compression is achieved by re- 


ducing or removing redundancy. The We sv an image is caused by the correlation 
tha 


between pixels. so transforming the pi epresentation where they are decorrelated 







eliminates the redundancy. It is als € to think of a transform in terms of the entro- 


py of the image. In a highly corr mage. the Pines have equiprobable values, 
which results in maximum ef y. If the transfor oy 5 are decorrelated, certain pix- 
el values become P^ id wn having lar; Ss ilities. while others are rare. This 
results in small e y. uantizing the t nv values can produce efficient lossy im- 
age compressi ant the transforrr ues to be independent because coding inde- 
pendent ru it simpler to construct a statistical model. We now turn to color ima- 
ges. A pixel in such an image consists of three color components, such as red. green, and 
blue. Most color images are either continuous-tone or discrete-tone. 

Approach 6: The principle of this approach is to separate a continuous-tone color im- 
age into three grayscale images and compress each of the three separately. using approa- 
ches 3. 4, or 5. For a continuous-tone image. the principle of image compression implies 
that adjacent pixels have similar. although perhaps not identical. colors. However. similar 
colors do not mean similar pixel values. Consider. for example. 12-bit pixel values where 
each color component is expressed in four bits. Thus. the 12 bits 1000 | 0100 | 0000 repre- 
sent a pixel whose color is a mixture of eight units of red (about 50%, since the maximum 
is 15 units), four units of green (about 2576). and no blue. Now imagine two adjacent 
pixels with values 0011 | 0101 | 0011 and 0010 | 0101 | 0011. They have similar colors. 
since only their red components differ. and only by one unit. However. when considered as 
12-bit numbers. the two numbers 001101010011 and 001001010011 are very different. 








| UNIT 3 IMAGE COMPRESSION 4 


since they differ in one of their most significant bits. 

An important feature of this approach is to use a luminance chrominance color repre- 
sentation instead of the more common RGB'?, The advantage of the luminance chromi- 
nance color representation is that the eye is sensitive to small changes in luminance but not 
in chrominance. This allows the loss of considerable data in the chrominance components, 


while making it possible to decode the image without a significant visible loss of quality. 


New Words and Phrases 






pixel n. ($3 adjacent adj.  4H4[ 
encode vt. 编码 decoder n. i9 
stream n， 流 ， 连 续 传 输 的 信息 序列 correlation ， 相 关 性 


zigzag n， 锯 齿 形 线条 elimin; 消除 ， 淘 汰 
quad tree 四 又 树 e e acj， 等 概率 的 
redundancy n. JUR. IRE .owauimize vt， 量 化， 数值 化 
nonuniform adj. 不 均匀 的 , 不一致 的 "Ry adj. 有 损 的 ， 致 损耗 的 
encoder n， 编 码 器 NS luminance n. 亮度, 发 光度 


grayscale n. IKE X chrominai x mE 3 


intuitively adv. HH, 


Notes Z^ 
f we 

l. bi-leveliMages: tH binary images. — fti AUR gii. 

2. rundength encoding; RLE， 游 程 编码 。 原 理 是 将 一 扫描 行 中 的 颜色 值 相同 的 相 邻 
像素 用 一 个 计数 值 和 那些 像素 的 颜色 值 来 代 蔡 。 

3. space-filling curve; 空间 填充 曲线 。 

4. Laplace distribution: 拉 普 拉 斯 分 布 ， 如 果 随 机 变量 的 概率 密度 函数 分 布 为 
ex((- ^5) if icy 


< [ 
—lz—al j=} (3-2) 


ar sd. 
f |n = z| 5 





那么 它 就 是 拉 普 拉 斯 分 布 。 其 中 , p EMERSA, 00 是 尺度 参数 。 如 果 yy 二 0， 那 
人 么 ， 正 半 部 分 恰好 是 尺度 为 1/2 的 指数 分 布 。 

5. JBIG; Joint Bi-level Image Experts Group， 联 合 二 值 图 像 专 家 组 ， 是 发 布 二 值 图 
像 编 码 标准 的 专家 组 。 

6. RGB: RGB 色彩 模式 是 工业 界 的 一 种 颜色 标准 ， 是 通过 对 红 (R)、 绿 (G)、 蓝 (B) 












J 


ZAREE EIR E fI HERE 2 [8] BU e E 6386 5 SE HY BA, RGB 即 是 代表 
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红 、 绿 、 蓝 三 个 通道 的 颜色 ， 这 个 标准 几乎 包括 了 人 类 视力 所 能 感知 的 所 有 颜色 ， 是 目前 
运用 最 广 的 颜色 系统 之 一 。 
Exercises 


I. Please translate the following words and phrases into Chinese. 
l. grayscale image 

2. variable-length codes 

3. image compression Xs 
4. adaptive arithmetic coding SS 

5. wavelet transform xv 

6. pixel values WV 

7. color components A x 

8. redundant information K 

9. adjacent pixel NS 

10. prefix codes EI S XP 

I. Fill in the venir missing word(s, ne table below. 


























pixel I grayscale - storage glitches 
stream transmitted identify internal 
redundant constructed corrupted immune 
components. compress image misunderstanding 
hardware device fraction circuits 
1. A possible way to such an image is to scan it. regions, and find 
repeating regions. If a region B is identical to an already found region A. then B can be 


compressed by writing a pointer to A on the compressed 

2. An image compression method that has been developed specifically for a certain 
type of image can sometimes be used for other types. Any method for compressing bi-level 
images. for example. can be used to compress images by separating the bitplanes 
and compressing each individually. as if it were a bi-level 

3. Color images provide another example of using the same compression method 


across image types. Any compression method for grayscale images can be used to compress 
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color images. In a color image. each is represented by three color 
as RGD). 

4. Every time information is + over any channel. it may get by 
noise. In fact. even when information is stored in a device. it may become bad. 
because no piece of is absolutely reliable. 

5. Our language is because only a very small of all possible words are 
valid. A huge number of words can be with the 26 letters of the English alphabet. 

6. Errors are a fact of life. They are all around us. are found everywhere. and are re- 
sponsible for many and accidents and for much misery and . Unfortu- 
nately, computer data is not an exception to this rule and is not to errors, Digital 


data written on a storage such as a disk. CD. or bject to corruption. 





Similarly. data stored in the computer's BR bécome bad because of a 
i 


sudden surge in the electrical voltage. a stray cosmi thing the memory + or 


an extreme variation of temperature. 
II. Translate the following paragraphs>i inese. 


1. Developers and implementers Sy image compression methods need a standard 


metric to measure the quality of r structed images M on with the original ones. The 
better a reconstructed Wr nbles the original t igger should be the value pro- 
duced by this metric. Sy a, metric should meas a dimensionless number. and that 


ve sensitive to sey 
Re 






number should no} ions in the reconstructed image. 


Sy 





























2. The advantage of the luminance chrominance color representation is that the eye is 
sensitive to small changes in luminance but not in chrominance. This allows the loss of 
considerable data in the chrominance components, while making it possible to decode the 


image without a significant visible loss of quality. 
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3. A discrete-tone image. also called a graphical image or a synthetic image. is nor- 
mally an artificial image. It may have a few colors or many colors. but it does not have the 
noise and blurring of a natural image. Examples are an artificial object or machine. a page 


of text. a chart, a cartoon, or the contents of a computer screen. 

















4. A bi-level Cor monochromatic) image is an image the pixels can have one of 


two values, normally referred to as black and white. ixel in such an image is repre- 


sented by one bit. making this the simplest SN 
oa 


I 




















Images are all around us. We see them in color and in high resolution. Many objects 
(especially artificial objects) seem perfectly smooth. with no jagged edges and no graini- 
ness. Computer graphics. on the other hand. deals with images that consist of small dots, 
pixels. The term pixel stands for “picture element”. When we first hear of this feature of 
computer graphics. we tend to dismiss the entire field as trivial. It seems intuitively obvi- 
ous that an image that consists of dots would always look pixelated, grainy, rough, and in- 
ferior to what we see with our eyes. Yet state-of-the-art computer-generated images are of- 
ten difficult or impossible to distinguish from their real counterparts, even though they are 
discrete, made of pixels, and not continuous. 

Most engineers. programmers. and users think of pixels as small squares. and this is 
generally true for pixels on computer monitors. Pixels in other digital output devices (dis- 
plays or printers) may be rectangular or circular. However. in principle. a pixel should be 


considered a mathematical. dimensionless. point. It seems impossible to reconstruct a con- 
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tinuous image from an array of discrete pixels. but this is precisely what the surprising 
Nyquist-Shannon sampling theorem tells us (in fact. what it guarantees). Here we apply it 
to two-dimensional images. 

Audio is a good starting point to understand the sampling theorem. Sound fed into a 
microphone is converted to an electrical voltage that varies with time; it becomes a wave. 


A wave has a frequency, and a wave that varies all the time consis 





s of many frequencies. 
We denote the maximum frequency contained in a wave by B (cycles per second. or 
Hertz). The sampling theorem says that it is possible to reconstruct the original wave if it 
is sampled at a rate greater than 2B samples per second. 

An image is a rectilinear array of point samples (pixels). The,sampling theorem guar- 


antees that we will be able to reconstruct the image (i. e. « to te the color of every 







mathematical point in the image) if we sample the image a greater than 2B pixels per 
unit length. where B is the maximum pixel frequen the image. In practice. pixels. 


their values, and their frequencies depend on thi råðy of the capturing device. An ideal 
device should measure the color of an image a e points. but image sensors (CCDs and 
CMOS) used in real devices (cameras a EAR are often far from ideal. 

Because of physical limitations acturing defects. and the need to capture e- 
nough light, an image sensor oft easures the averag Y r.Cor intensity) of a small are- 
a of the image, instead of t dolor at a point. 


Assuming that peers pixels for cae image. we compute the color 
of a given point ein ge by interpolatiga. We we only touch on the principles of bi- 
linear and bi c interpolations. The di Sion assumes a grayscale image. where each 
pixel is a PN indicating a shade of gray (an intensity). but interpolation can easily be 
extended to color images. where a pixel is a triplet of primary colors. 

Bilinear interpolation, Given enough pixels of an image and given a point Q in the im- 
age. we use bilinear interpolation to compute the intensity (grayscale) of the image at Q in 
the following steps. 

1. We select the four pixels surrounding Q, denote their values by a. b, c. and d 
(Figure 3. 1) and convert them to three-dimensional points by considering the value of a 
pixel the z coordinate of the point and adding appropriate x and y coordinates. We end up 
with the points Po =(0, 0, a), Pa =(0, 1, b), Pu=(, 1, c), and Po=(1, 0, d). 

2. We compute the parametric equations of the straight segment Lı (u) running from 
Pto Pi and the segment L; (wu) running from Pu to Pi. These equations are the simple 
linear interpolations L; (u) — P» I —:) - Piou and L; G0 — P O—:i)0-- Piu. 

3. We compute the parametric equation Pu. w of the bilinear surface generated by in- 


terpolating segments L; (u) and L;(w). The equation is 








" i] 


PG. W=L G0(G—9w) 4L; (W w= P du) Aw) + Py ude) +P Aww + Py ww 














Figure 3.1 A Bilinear Surface. SS 
Figure 3. 1 is an example of such a surface. The surface is spanned when param- 


eters u and w are varied independently in the in 1 1]. 
4. We determine the values of u and w S. ven point Q and compute its intensity 
- 

iks 


as the z coordinate of surface P(u. w) values. 






Bicubic interpolation. The princip: cubic interpolation is similar to that of bilinear 


RUN nage. we esie of 4X4 pixels. convert 
: and compute t Mál surface P(u, w) that passes 
through these points. Once this surface is kno b^ values of parameters u and w for Q 
are determined MS infensity of image ki eom the height of the surface at the 


ese values. RX 


K 


interpolation. Given a point 






: : ~ 
them to three-dimensional 





point determi, 





New Words and Phrases 





resolution n. 分辨 率 discrete adj. 离散 的 
artificial adj. 人工 的 ， 人 造 的 rectangular adj. 矩形 的 
jagged ages 锯齿 边缘 dimensionless adj. 无 维 数 的 
graininess n， 粒 状 voltage n， 电 压 
trivial adj. 平凡 的 denote vut. WE. HAR. 代表 
intuitively adv。， 直 觉 地 ， 直 观 地 two-dimensional n. 二 维 的 , 平面 的 
pixelate vi. 像素 化 bicubic n， 双 三 次 的 ， 两 次 立方 的 

n。， 滤 镜 interpolation n， 插 值 
grainy adj. 粒状 的 triplet n。 三 元 组 
state-of-the-art 技术 发 展 水 平 bilinear interpolation 双 线 性 插值 (内 插 ) 


counterpart n. MÆ. HXH parametric equation 参数 方程 











segment n. 4} 


Exercises 


I. Answer the following questions. 


1. What is pixel? 
2. What does sampling theorem guarantee? 
3. What is bilinearinterpolation? 


4. What is bicubicinterpolation? 


]. Translate the following sentences into Chinese. SS 
1. Most engineers. programmers, and users d ls as small squares. and this 
i 


is generally true for pixels on computer monit s in other digital output devices 
(displays or printers) may be rectangular aw However, in principle, a pixel should 
1 


int, 


ur 


be considered a mathematical, dimensio; 

























sical limitations, acturing defects. and the need to capture e- 


x 


2. Beca NS. 
nough NEN age sensor often measures the average color Cor intensity) of a small are- 


a of the image, instead of the color at a point. 

















3. Audio is a good starting point to understand the sampling theorem. Sound fed into 
a microphone is converted to an electrical voltage that varies with time; it becomes a wave. 
A wave has a frequency. and a wave that varies all the time consists of many frequencies. 
We denote the maximum frequency contained in a wave by B (cycles per second. or 


Hertz). 
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Model Correction for Cross-channel Chroma Prediction 


A new inter-channel coding mode named LM mode has been intensively explored in the 






HEVC standardization project. This mode predicts the chroma from the luma signal 


using a linear model whose parameters are inferred from oring reconstructed luma 
and chroma samples. Although this mode presents xe coding efficiency. it is ob- 
served that ill cases in the linear parameters calc. Ne n 
per gives an overview of the LM mode MEL Sid novel model correction scheme to de- 


be detected and fixed. This pa- 


tect and correct those ill cases. Simulati s show significant bit-rate savings by the 


proposed correction scheme with limi 







:d complexity. 
Progressive-to-lossless Coi 
S 


of Color si ane Using Macropixel spec- 
VE X 


We present ae nplexity eta spectral-spatial transform that allows 
for efficient loss Am ossy compressi Pw or-filter-array images (also referred to as 
camera-raw os | The main advantage $ 


array values into a format that can be directly compressed in a loss less. lossy. or progres- 


tral-spatial Transformation 







this new transform is that it maps the pixel 


sive-to-loss less manner by an existing typical image coder such as JPEG 2000 or JPEG XR. 
Thus. no special coded design is needed for compressing the camera-raw data. Another ad- 
vantage is that the new transform allows for mild compression of camera-raw data in a near- 
loss less format. allowing for very high quality offline post-processing. but with camera- 


raw files that can be half the size of those of existing camera-raw formats. 
Efficient Data Packet Compression for Cache Coherent Multiprocessor Systems 


Multiprocessor systems have been popular for their high performance not only for 
server markets but also for computing environments for general users. With the inreased 
software complexity. networking overheads in multiprocessor systems are becoming one of 
the most influential factors in overall system performance. In this paper. we attempt to re- 
duce communication overheads through a data packet compression technique integrating a 


cache coherence protocol. Here we propose Variable Size Compression (VSC) scheme that 
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compresses or completely eliminates data packets while harmonizing with existing cache coherence 





protocols. Simulation results show approximately 23% of improvement on average in terms of o- 
verall system performance when compared with the most recent compression scheme. VSC also 


improves performance by 20% on average in terms of cache miss latency. 
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ARTIFICIAL INTELLIGENCE 


Text: Properties of Task E 


The range of task environments that might Alis obviously 








ever, identify a fairly small number of 


categorized. These dimensions deter 


tion, First, we list the mi oid 
the ideas (Table 4. 1). d itions here are in 


ments 


vast. We can, how- 


along which task environments can be 


large extent. the appropriate agent design 


indipal families of techniques for agent implementa- 
en we analyze & | task environments to illustrate 


Their PEAS Descriptions 





























Agent Type crídrfhance Measure| Envi jent Actuators Sensors 
7 " n y . | Display of questions. | Keyboard entry of 
Medical diagno- | Healthy patient. re- | Patient, hospi- 3 "s 
tests. diagnoses. | symptoms. findings. 
system duced costs tal, staff 
treatments, referrals | patient's answers 
Satellite image | Correct image cate- | Downlink from | Display of scene cate- | |, é 
His > Bn k Hee Color pixel arrays 
analysis system | gorization orbiting satellite | gorization 
Part-picking ro- | Percentage of parts | Conveyor belt , Camera, joint 
; z Jointed arm and hand 
bot in correct bins with parts; bins angle sensors 
T á A " Temperature. pres- 
Refinery con- | Purity, yield, safe- | Refinery, opera- | Valves. pumps. . 
A sure, chemical sen- 
troller ty tors heaters, displays 
sors 
, " B Display of exercises. 
Interactive Eng- | Student's score | Set of students. Mel 
è i suggestions, correc- | Keyboard entry 
lish tutor on test testing agency i 
tions 














Fully observable vs. partially observable; If an agent's sensors give 


plete state of the environment at each point in time, then we say that t 


it access to the com- 





he task environment 








is fully observable. A task environment is effectively fully observable if the sensors detect 
all aspects that are relevant to the choice of action; relevance. in turn. depends on the per- 
formance measure. Fully observable environments are convenient because the agent need 
not maintain any internal state to keep track of the world. An environment might be par- 


tially observable because of noisy and inaccurate sensors or because parts of the state are 





simply missing from the sensor data—for example. a vacuum agent with only a local dirt 
sensor cannot tell whether there is dirt in other squares. and an automated taxi cannot see 
what other drivers are thinking. If the agent has no sensors at all then the environment is 
unobservable. One might think that in such cases the agent's plight is hopeless. but the 
agent's goals may still be achievable. sometimes with certainty. 

Single agent vs. multiagent; The distinction between singl and multiagent envi- 
ronments may seem simple enough. For example. an EN ing a crossword puzzle by 

agen 


itself is clearly in a single-agent environment. where; t playing chess is in a two a- 





gent environment. There are, however. some s es. First, we have described how 

an entity may be viewed as an agent, Mv not explained which entities must be 

viewed as agents, Does an agent A (the/taxi er for example) have to treat an object B 

(another vehicle) as an agent, or ci reated merely as an object behaving according 

to the laws of physics, analogous ves at the beac ps s blowing in the wind? The 

key distinction is whether B V bblfavior is best M os laximizing a performance meas- 
o! 


ure whose value depen 








gent A's behavio ample. in chess, the opponent enti- 






ty B is trying to m nizd its performance + which. by the rules of chess, minimi- 


zes agent ay. Brice measure, Tab e 


In the taxi-d environment, on the other hand, avoiding collisions maximizes the per- 


as 





is a competitive multiagent environment, 


H 


formance measure of all agents, so it is a partially cooperative multiagent environment. It 
is also partially competitive because, for example, only one car can occupy a parking space. 
The agent-design problems in multiagent environments are often quite different from those 
in single-agent environments; for example. communication often emerges as a rational be- 
havior in multiagent environments; in some competitive environments. randomized behav- 
ior is rational because it avoids the pitfalls of predictability. 

Deterministic vs. stochastic: If the next state of the environment is completely deter- 
mined by the current state and the action executed by the agent. then we say the environ- 
ment is deterministic; otherwise. it is stochastic. In principle. an agent need not worry a- 
bout uncertainty in a fully observable. deterministic environment. (In our definition. we 
ignore uncertainty that arises purely from the actions of other agents in a multiagent envi- 
ronment; thus. a game can be deterministic even though each agent may be unable to pre- 


dict the actions of the others. ) If the environment is partially observable. however. then it 
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could appear to be stochastic. Most real situations are so complex that it is impossible to 
keep track of all the unobserved aspects; for practical purposes. they must be treated as 
stochastic. Taxi driving is clearly stochastic in this sense. because one can never predict 
the behavior of traffic exactly; moreover. one's tires blow out and one's engine seizes up 
without warning. The vacuum world as we described it is deterministic, but variations can 
include stochastic elements such as randomly appearing dirt and an unreliable suction mech- 
anism, We say an environment is uncertain if it is not fully observable or not deterministic. 
One final note: our use of the word "stochastic" generally implies that uncertainty about 
outcomes is quantified in terms of probabilities; a nondeterministic environment is one in 


which actions are characterized by their possible outcomes. but probabilities are atta- 






ched to them. Nondeterministic environment descriptions ar associated with per- 


formance measures that require the agent to succeed for ssible outcomes of its ac- 
tions. 

Episodic vs. sequential: In an episodic task onhient. the agent's experience is di- 
vided into atomic episodes. In each episode tl receives a percept and then performs a 
single action, Crucially, the next episo TAA depend on the actions taken in previous 


episodes. Many classification tasks ar&episodic. For example. an agent that has to spot de- 





fective parts on an assembly line Seach decision on Beacon part. regardless of pre- 

vious decisions; moreover. B rrent decision oon t whether the next part is de- 

fective. In sequential e ents, on the other ha the current decision could affect all 

future decisions. s and taxi driving SHR tial; in both cases, short-term actions 
一 

can have lon D sequences. Episode afvironments are much simpler than sequential 

WEE N use the agent does not al to think ahead. 

Static vs. dynamic: If the environment can change while an agent is deliberating, then 
we say the environment is dynamic for that agent; otherwise, it is static, Static environ- 
ments are easy to deal with because the agent need not keep looking at the world while it is 
deciding n an action. nor need it worry about the passage of time. Dynamic environments, 
on the other hand. are continuously asking the agent what it wants to do; if it hasn't decid- 
ed yet. hat counts as deciding to do nothing. If the environment itself does not change with 
the passage of time but the agent's performance score does. then we say the environment is 
semidynamic. Taxi driving is clearly dynamic: the other cars and the taxi itself keep mov- 


ing while the driving algorithm dithers about what to do next. Chess. when played with a 





clock. is semidynamic. Crossword puzzles are static. 
Discrete vs. continuous: The discrete/continuous distinction applies to the state of the 
environment, to the way time is handled. and to the percepts and actions of the agent. For 


example. the chess environment has a finite number of distinct states (excluding the 
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clock). Chess also has a discrete set of percepts and actions. Taxi driving is a continuous- 





state and continuous-time problem: the speed and location of the taxi and of the other vehi- 
cles sweep through a range of continuous values and do so smoothly over time. Taxi-driv- 
ing actions are also continuous (steering angles. etc. ). Input from digital cameras is dis- 
crete, strictly speaking. but is typically treated as representing continuously varying inten- 


sities and locations. 





Known vs. unknown: Strictly speaking. this distinction refers not to the environment 
itself but to the agent's Cor designer’s) state of knowledge about the “laws of physics" of 
the environment. In a known environment. the outcomes (or outcome probabilities if the 
environment is stochastic) for all actions are given. Obviously. if.the environment is un- 
known, the agent will have to learn how it works in order good decisions. Note 
that the distinction between known and unknown environjeKt* is not the same as the one 
between fully and partially observable € ite possible for a known envi- 
ronment to be partially observable—for examp Shlitaire^ card games, I know the 
rules but am still unable to see the cards tha a ot yet been turned over. Conversely, 
an unknown environment can be fully obse e—in a new video game. the screen may 


show the entire game state but I still now what the buttons do until I try them. 
New Words and Phrases JU ge 


agent n， 智 能 体 Zh- X Sion n. WA. mif 





dimension n. J NA randomize vt， 随 机 化 
detect vt. KON deterministic adj. 确定 性 的 
vacuum n. 真空 stochastic adj. 随机 的 
adj. Hay episodic adj. WEWERE; 偶尔 发 生 的 
multiagent n. 多 智能 体 semidynamic acj。 半 动态 的 
subtle adj， 微 妙 的 ， 敏 感 的 crossword n， 纵 横 填 字 字谜 
entity n， 实 体 ， 本 质 conversely adv. RZ. 相反 地 


analogous adj. 模拟 的 
opponent n. XF 
adj. 对 立 的 


Notes 


l. agent: 智能 体 ， 指 能 自主 活动 的 软件 或 者 硬件 实体 。 任 何 独立 的 能 够 思想 并 可 以 
同 环境 交互 的 实体 都 可 以 抽象 为 智能 体 。 
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2. solitaire: 一 个 著名 的 计算 机 纸牌 游戏 ， 需 要 将 两 个 环 状 牌 基 上 的 纸牌 一 个 按 从 小 
到 大 另 一 个 按 从 大 到 小 的 顺序 依次 排列 。 例 如 ，Windows 操作 系统 中 的 纸牌 大 战 。 


Exercises 


I. Translate the following phrases into Chinese. 


discrete distinction 
continuous distinction 


L 

2. 

3. sensor 

4. single agent «e 


multiagent 


5 S 
6. performance measure xv 
8 

9 


laws of physics 


. dynamitic environment ~ x 

. stochastic environment ^K 

10. vacuum agent S A 
Il. Fill in the blanks wir in the table. yah 














properties outcome eduction environment 
ANO T rationality .^ distinguish function 

















categorizati vacuum sensors define 
sequence ensure perfectu percept 
outcomes perceiving preferences engage 
1. Notice that the agent program is very small indeed compared to the corre- 


sponding table. The most obvious reduction comes from ignoring the percept history, 





which cuts down the number of possibilities from 4T to just 4. A further. small 
comes from the fact that when the current square is dirty, the action does not depend on 
the 

2. We need to be careful to between rationality and omniscience. An omnis- 
cient agent knows the actual of its actions and can act accordingly; but omnisci- 
ence is impossible in reality. 

3. How well an agent can behave depends on the nature of the ; some envi- 


ronments are more difficult than others. We give a crude of environments and 
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show how of an environment influence the design of suitable agents for that envi- 
ronment. 
4. An agent is anything that can be viewed as its environment through 


and acting upon that environment through actuators. A human agent has eyes. ears, and 
other organs for sensors and hands. legs. vocal tract. and so on for actuators. 


5. By specifying the agent's there is to say about the agent. Mathematically speaking. 


we say that an agent's behavior is described by the agent that maps any given per- 
cept to an action. 

6. If we success in terms of agent's opinion of its own performance, an a- 
gent could achieve perfect simply by deluding itself that its performance 


was ; 
7. Our definition of rationality does not require i then. because the ra- 
te; 


tional choice depends only on the NS e must also that 
we haven't inadvertently allowed the agent to decidedly underintelligent activi- 


ties. x e 
ai irst have between the different 
ces into awe A 


1. We have liste e_medical-diagnosisy t K single-agent because the disease 
process in a pai profitably mod: Fr Aly but a medical-diagnosis system 
ES 


8. To make such choices. an age 






possible of the various pla 


II. Translate the followin 


might also NY geal with recalcitrant ts and skeptical staff, so the environment 
m 





could have a agent aspect. Furthermore, medical diagnosis is episodic if one conceives 
of the task as selecting a diagnosis given a list of symptoms; the problem is sequential if the 
task can include proposing a series of tests. evaluating progress over the course of treat- 


ment, and so on. 


























2. Computer scientists are often faced with the task of comparing algorithms to see 


how fast they run or how much memory they require. There are two approaches to this 
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task. The first is benchmarking 一 running the algorithms on a computer and measuring 
speed in seconds and memory consumption in bytes. Ultimately. this is what really mat- 
ters, but a benchmark can be unsatisfactory because it is so specific; it measures the per- 
formance of a particular program written in a particular language. running on a particular 
computer, with a particular compiler and particular input data. From the single result that 
the benchmark provides. it can be difficult to predict how well the algorithm would do on a 
different compiler. computer, or data set. The second approach relies on a mathematical a- 


nalysis of algorithms. independently of the particular implementation and input. 


A 
& 
AF 
E 


Readi ing Under, Üscertaint 


Agents may Poe. to ES uncertain. Ne due to partial observability, nonde- 


terminism, Nj ation of the two ien may never know for certain what state 
PN 


it’s in or wh 


























will end up after a sequence of actions. 

We have seen problem-solving agents and logical agents designed to handle uncertainty 
by keeping track of a belief state—a representation of the set of all possible world states 
that it might be in—and generating a contingency plan that handles every possible eventual- 
ity that its sensors may report during execution. Despite its many virtues. however, this 
approach has significant drawbacks when taken literally as a recipe for creating agent pro- 
grams. 

(1) When interpreting partial sensor information, a logical agent must consider every 
logically possible explanation for the observations. no matter how unlikely. This leads to 
impossible large and complex belief-state representations. 

(2) A correct contingent plan that handles every eventuality can grow arbitrarily large 
and must consider arbitrarily unlikely contingencies. 

(3) Sometimes there is no plan that is guaranteed to achieve the goal—yet the agent 


must act. It must have some way to compare the merits of plans that are not guaranteed. 
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Suppose. for example. that an automated taxi has the goal of delivering a passenger to 
the airport on time. The agent forms a plan. As. that involves leaving home 90 minutes 
before the flight departs and driving at a reasonable speed. Even though the airport is only 
about 5 miles away. a logical taxi agent will not be able to conclude with certainty that 


“ 


Plan A» will get us to the airport in time. ” Instead, it reaches the weaker conclusion 
“Plan As, will get us to the airport in time, as long as the car doesn't break down or run 
out of gas. and I don't get into an accident. and there are no accidents on the bridge. and 
the plane doesn't leave early, and no meteorite hits the car, and..." None of these condi- 
tions can be deduced for sure. so the plan's success cannot be inferred. This is the qualifi- 
cation problem. for which we so far have seen no real solution. 


Nonetheless. in some senseAw is in fact the right thing,t What do we mean by 







this? We mean that out of all the plans that could be exec! 99 is expected to maximize 





the agent's performance measure (where the expectation is relative to the agent's knowl- 





edge about the environment). The performance. 


time for the flight, avoiding a long. Ms 


ing tickets along the way. The agent's 


sure/includes getting to the airport in 
it at the airport, and avoiding speed- 
e cannot guarantee any of these outcomes 


for Ag), but it can provide some degré elief that they will be achieved. 





Other plans. such asA ni crease the ire that it will get to the air- 
port on time, but also incre: e likelihood of a TE $! The right thing to do 一 the ra- 


tional decision—therefor&depends on both the iXe importance of various goals and the 









gree to which. theyssil achieved. The remainder of this section 


一 


hones these i 1 i ftopment of the general theories of uncertain 
reasoning and onal decisions that we present in this paper. 


Summarizing Uncertainty 


Let's consider an example of uncertain reasoning: diagnosing a dental patient's tooth- 
ache. Diagnosis—whether for medicine. automobile repair. or whatever—almost always 
involves uncertainty. Let us try to write rules for dental diagnosis using propositional log- 
ic. so that we can see how the logical approach breaks down. Consider the following simple 
rule: 

Toothache > Cavity 

The problem is that this rule is wrong. Not all patients with toothaches have cavities; 
some of them have gum disease. an abscess. or one of several other problems; 

Toothache > Cavity V Gum Problem V Absce: 


Unfortunately. in order to make the rule true. we have to add an almost unlimited list 





of possible problems. We could try turning the rule into a causal rule: 
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Cavity — Toothache 

But this rule is not right either; not all cavities cause pain. The only way to fix the 
rule is to make it logically exhaustive: to augment the left-hand side with all the qualifica- 
tions required for a cavity to cause a toothache. Trying to use logic to cope with a domain 
like medical diagnosis thus fails for three main reasons. 

(1) Laziness; It is LAZINESS too much work to list the complete set of antecedents 
or consequents needed to ensure an exceptionless rule and too hard to use such rules. 

(2) Theoretical ignorance: Medical science has no complete theory for the domain. 

(3) Practical ignorance; Even if we know all the rules, we might be uncertain about a 
particular patient because not all the necessary tests have been or gan be run. 


The connection between toothaches and cavities is just jcal consequence in ei- 







ther direction. This is typical of the medical domain, as smost other judgmental do- 


mains: law, business, design, automobile repair. xe + dating. and so on. The 
agent's knowledge an at best provide only a de; ef in the relevant sentences, Our 
f 


main tool for PROBABILITY dealing with, roc belief is probability theory. In the 


terminology of latter parts. the ontologi¢al c itments of logic and probability theory are 
the same—that the world is composed,o! hat do or do not hold in any particular case—but 
the epistemological commitments a ent; a logical weap each sentence to be true or 
false or has no opinion, wher probabilistic agent ave’ a numerical degree of belief be- 
tween 0 (for sentences t| re certainly false) and, 1-(Cektainly true). 

Probability Re way of summanizing\thé uncertainty that comes from our lazi- 
ness and ign ce, Hereby solving the q ation problem. We might not know for sure 


what afflicts a Da¥ticular patient, but we believe that there is. say. an 80% chance—that 
is. a probability of 0. 8—that the patient who has a toothache has a cavity. That is, we ex- 
pect that out of all the situations that are indistinguishable from the current situation as far 
as our knowledge goes. the patient will have a cavity in 80% of them. This belief could be 
derived from statistical data—80% of the toothache patients seen so far have had cavities— 
or from some general dental knowledge. or from a combination of evidence sources. 

One confusing point is that at the time of our diagnosis. there is no uncertainty in the 
actual world: the patient either has a cavity or doesn't. So what does it mean to say the 
probability of a cavity is 0. 8? Shouldn't it be either 0 or 1? The answer is that probability 
statements are made with respect to a knowledge state. not with respect to the real world. 
We say “The probability that the patient has a cavity. given that she has a toothache, is 
0. 8. " If we later learn that the patient has a history of gum disease. we can make a differ- 
ent statement; “The probability that the patient has a cavity. given that she has a 


toothache and a history of gum disease. is 0. 4. " If we gather further conclusive evidence a- 








gainst a cavity. we can say "The probability that the patient has a cavity. given all we now 
know. is almost 0. " Note that these statements do not contradict each other; each is a sep- 


arate assertion about a different knowledge state. 


New Words and Phrases 


nondeterminism n， 不 确定 性 主义 augment vt， 增 强 ， 增 加 

contingency n. 偶发 事件 ; 可 能 性 ， 偶 然 性 n. 增加， 补充 物 

recipe n. FIK domain n， 论 域 

hone n. A. E antecedent n， 前 项 

rational n， 有 理 数 ad y. 8. Jeff 
diagnose vt. Wr exceptionles 不 除外 的 ,无 例外 的 
diagnosis n， 诊 断 ， 判 断 BA UM 承诺 ， 许 诺 

propositional adj. 命题 的 i ological adj. 认识论 的 ， 知 识 论 上 的 
cavity n， 腔 ,空洞 ， 洞 ~ NO vt. & vi. 得 到 ， 导 出 ,来 自 

gum disease 牙 周 病 ^K 


abscess n. 脓肿 S x 
Exercises J oan 


A 
A 
Is ANS llowing ES 


1. What i$ an agent? 
2. Why do we need acting under uncertainty? 


3. How to summarize uncertainty? 


]. Translate the following sentences into Chinese. 


l. Probability provides a way of summarizing the uncertainty that comes from our la- 


ziness and ignorance. thereby solving the qualification problem. 








2. Agents may need to handle uncertainty, whether due to partial observability, non- 
determinism. or a combination of the two. An agent may never know for certain what state 


it's in or where it will end up after a sequence of actions. 
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3. When interpreting partial sensor information. a logical agent must consider every 











logically possible explanation for the observations. no matter how unlikely. This leads to 


impossible large and complex belief-state representations. 











Abstract Reading «e 


An Event-based Distributed Diagnosis Framewo) "S tructural Model Decomposition 


Complex engineering systems require Ro mline fault diagnosis methodologies to 
improve safety and reduce maintenance aN t aa diagnosis approaches are cen- 
tralized. but these solutions do not sca . Also. centralized diagnosis solutions are dif- 





ficult to implement on incre: 


valent i am embedded systems. 
rov Íramewi er sical systems with continuous 
behavior. Using j^ iA ti a structur. A ecomposition method from the Ar- 
tificial Intelligenc del ased diagnosis ¢ Suns we develop a distributed diag- 
noser design SOs" to build local Bs ge diagnosers. These diagnosers are con- 
structed ws global diagnosability analysis of the system. enabling them to generate 


local diagnosis results that are globally correct without the use of a centralized coordinator. 





This paper presents a distri 


We also use Possible Conflicts to design local parameter estimators that are integrated with 
the local diagnosers to form a comprehensive distributed diagnosis framework. Hence. this 
is a fully distributed approach to fault detection. isolation, and identification. We evaluate 
the developed scheme on a four-wheeled rover for different design scenarios to show the ad- 


vantages of using Possible Conflicts, and generate on-line diagnosis results in simulation to 





demonstrate the approach, 
On the Revision of Informant Credibility Orders 


In this paper we propose an approach to multi-source belief revision where the trust or 
credibility assigned to informant agents can be revised. In our proposal. the credibility of 
each informant represented as a strict partial order among informant agents. will be main- 


tained in a repository called credibility base. Upon arrival of new information concerning 









| UNIT 4 ARTIFICIAL INTELLIGENCE 


the credibility of its peers. an agent will be capable of revising this strict partial order. 





changing the trust assigned to its peers accordingly. Our goal is to formalize a set of change 
operators over the credibility base: expansion. contraction. prioritized. and non-prioritized 
revision, These operators will provide the capability of dynamically modifying the credibili- 
ty of informants considering the reliability of the information. This dynamics will reflect a 
new perception of trust assigned to the informant, or extend the set of informants by ad- 


mitting the addition of new informant agents. 


How Much Does It Help to Know What She Knows You Know? An Agent-based Simula- 


able mental content such as beliefs, desires, and inten, thers. Humans are known 


tion Study És 
In everyday life, people make use of theory of mind a tly attributing unobserv- 
oi 


to be able to use this ability recursively. That is, ASM in higher-order theory of 


mind, and consider what others believe about t n OR In this paper. we use a- 






E! 


gent-based computational models to inves the evolution of higher-order theory of 


mind. We consider higher-order the ihd across four different competitive games. 


including repeated single-shot MA ed extensive fo: ames, and determine the ad- 
vantage of higher-order ee ifid agents over t er-order theory of mind oppo- 
ames 


nents. Across these fo we find a common\ngt oth in which first-order and second- 
order theory of mindagentts clearly outperfori rw that are more limited in their a- 
bility to make use ory of mind. wl dvantage for deeper recursion to third-or- 
der theory o! Nd limited in comparison! 
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DATA STRUCTURES 





Text: Contiguous Representati 


NS 


The obvious way to represent an array in is to store its elements in a table. 


Arrays 


that is, in consecutive cells in memory. Fo mple. consider an array X consisting of six 





elements X [1] through X [6]. where i N=? for each i, Figure 5. 1 shows a contigu- 
ous representation of X starting atpiemoty address X. re it is assumed that each inte- 
ger occupies four memory D he i^ element iris at address X+4(i—1). In 


general, if X is the addxes e first cell in mem an array with indices /.. u, and if 


each element d hen the i" element Qo starting at address X+-L(i—1) and 
can be retrieved i sum: time. x 


Ds 


/ 
ANC X o4 X sh X +12 X +16 X +20 
4 | 9 | 16 | 25 | 36 


xp] xp] XB) xi xp) xil 

















Figure 5.1 A one-dimensional array represented as a table in contiguous storage. The address of 

the beginning of the array is X; each element occupies four memory locations, The index set of this 

array is 1.**.6 and X [i] =? for each i. 

What about iteration? It would be possible to iterate over the elements ofX by access- 
ing X [/]. then X [7-1]. and so forth up to X [u], thus performing the address calcula- 
tion Length(X) times. A better method is to start with X (which is the address of X [/]) 





and proceed from element to element by adding L on each iteration. Although this im- 
provement reduces the amount of arithmetic that is performed, the overhead." is still linear 
in the length of the array. Of course. L. /. and u must be available somewhere in order to 


carry out these calculations. They can be stored in several places. 
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(1) The valuesL. /. and u can be stored starting at address X. The formula for the 
address of X [i] must then be adjusted slightly to account for the extra space used. 

(2) In strongly typed languages. some or all ofL. /. and u may be part of the defini- 
tion of X and may be stored elsewhere. Furthermore. if the language does not permit arbi- 
trary lower bounds in indexing then the value I is fixed and need not be stored anywhere. 

(3) A sentinel value can be stored just after the last element of the array. That is. 
memory address X+L (u—/+1) can contain some bit pattern that never occurs in the first 
word of the memory representation of any element of V. Now u need not be explicitly 
stored at all and iterations are terminated by detecting the sentinel value. A disadvantage of 
this method is that an iteration is required even to find the length of such an array. Never- 


theless, this representation is often used when L and / are fixé he programming lan- 







guage C. for example. represents character strings in this 
Storage in contiguous memory is less rss elements of the array have 
different lengths. because the i^ element cannot in constant time by simple arith- 


metic. To handle this situation we can WAR ents in memory anywhere and keep a 
W: 








table of pointers to the elements. N zu s an example of such an array whose ele- 
ments in order are the integer 9, s BCD, and an array of two integers. (The lat- 


ter two arrays are stored in s ti memory. not as Po pointers, ) The address of 
e 


the i element is now storedjji]lócation AEP, E^ P is the size of a pointer in 
memory. The a this implementation’ Re two: an extra pointer must be fol- 
lowed to MP AS s. and an array c uses p X n extra bits to store pointers 

needed to store t ia. But a major advantage is that single point- 


in addition t NM 
er manipulat EN ffice to move elements within the array; for this reason. tables of point- 


ers are often used when the elements are large (even if they are all of the same size). 





Figure 5.2 A three-element array implemented as a table of pointers, 


A two-dimensional array whose elements all have the same size can also be represented 
efficiently in contiguous storage; the only problem is to determine the order in which the 


elements should be placed. The two most common schemes are row major order. in which 








the rows are placed one after another in memory. and column major order. in which the 
columns are placed one after another. For example. consider the following two arrays. 
each of which has indices (1…4) X (1*5) : 


1 2 3 4 5 i S 9 3:7 
6 7 8 9 10 2 6 10 14 18 
= C= G-D 
11 12 13 14 15 3 7 11 15 19 
16 17 18 19 20 4 8 12 16 20 


The entries in arrayR suggest the order in which the elements of R are stored in row 
major order. First comes R [1. 1]. then R [ 1. 2]. and so forth up to R [1. 5] which is 
followed by R [2, 1]. In general. entry R [i, j] is stored in memory at address R--L. (5 


(i-1)+G —1)), where as usual each element requires spage the first element be- 







gins at address R. (The subtractions here reflect the fact is the first integer in each 


interval indexing R. ) The entries in C suggest ASA rder. Again element C [1. 


1] is first in memory. but it is followed by C [25 RD 1]. C [4. 1] and then C [1. 
p begins at address C+L. (4(j—1) 


+(i—1)). If particular iterations =e 一 for example. if row-by-row iteration is 


2]. If C is stored in column major order. SR 
more frequent than column-by-column bn 一 then one of these layouts may be more ad- 


vantageous than the MM. 


Row and column vis in er can be ys her dimensions. Let X be a 


general d-dimensional a Ae ith indices (4; *** tX (utu). When X is stored in 
row major order irs? element is XU 4 followed by X[4 stsl], X[h s. 
Li; ]* and MV. io X[A . laa Ext which the next element is X[ 4; s+. La + 


1,4]. "iw aways are represented in row major order we often say simply that "the last 
index varies fastest" as we examine successive elements in memory; each index is incremented on- 
ly after all subsequent indices reach their upper bounds. Similarly. to store X in column major or- 
der we store X[ 4 ,la ,°° sla], XLI 4- 1.1 «777. ]. and so forth up to X[ui 44 «7774; J. and the 
next element isX[/ ,2 + 1.45 ,4], so that it is the first index that “varies fastest”. 

Now suppose X is an arbitrary d-dimensional array as in the previous paragraph. that 
X is stored in row major order starting at address X. and that each element of X occupies 
space L. For arbitrary indices jı » j2 ***.j4* where in memory is element X[ ji s jz +***+jy_] lo- 
cated? (Of course. the answer is "nowhere" unless /, <j, < u, for each k. Verifying this 
condition is called range checking. Not all languages perform range checking; in some, it 
can be turned on for debugging and turned off when efficiency is important. ) 

For each k = 0.1.**:.d. define M; = L, 


dimension of X . M, is the amount of memory required to store each d—fdimensional “sub- 


where s; = u; — l; +1 is the size of the i^ 


uaa 


array” of X in which the first & indices are fixed; for example. M, is the number of memory 











locations from the start of element X[/; ./; .***.1;] to the end of element X[4 slz stsl. 
Up ego ett sty]. In particular. M; = L and M, is the size of the entire array X. There- 
fore, there are M, + (jı —1,) cells from X to the beginning of element X[ ji slz stla]. 
From that point, there are M, + (jə —1,) cells to the beginning of element X[ ji .j; slz , 
1, |, Continuing in this way, we find that element X[j; ,j» ,ja | is located at address 

X--M Gi — 1) - Mi Gs — L2 H-- +My Ga — (5-2) 


To make the Access operation as fast as possible, the values M; should be computed in 








advance. once and for all. Moreover. we should compute and save the single constant value 


X, = Mil; ++ Mla, since then we can write expression (1) as 











X— Xo t MjictMjc:- ye (5-3) 
which is faster to evaluate. requiring only approximatel, ad rios rather than 3d 
operations. Note that once we have the M, and Xs, the X re unnecessary for Access 


unless we wish to perform range checking. 


There is an independent context in which t SM useful; as mentioned before, 
row major representation of X is Bde... V te when we desire to iterate over the 
elements of X with the last index varyi Y . Suppose that we wish to have a version 


of the Iterate operation with the indice&«Manging in some other order. Any such iteration 





can be implemented efficiently, usi e fact that the se in memory between X[7;. 
je sj and XLji s g4 ja] is exactl 

When the elements fa ica re of different sizes in memory, we 
can extend the sch igure 5. 2 by st sure to the elements rather than the ele- 
ments thems n Soa is equal to rx a pointer. and a pointer must be followed 


after the address Calculation. 





All of the methods so far considered for representing multidimensional arrays permit 
access to any element of the array in constant time. There is a subtle point here. You may 
feel that access to an element of a multidimensional array cannot be performed in constant 
time, since the number of arithmetic operations depends on the number of dimensions, But 
the "size" of an array is the total number of its elements; the cost of accessing any element- 
of a d-dimensional array is independent of the number of elements in the array, although it 
does depend on d. This convention reflects the fact that the arrays used in computer pro- 
grams typically have a fixed number of dimensions. although they may have more or fewer 
elements depending on the problem size. Indeed. few languages support arrays in which d 


is not fixed for each given array. 
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New Words and Phrases 


consecutive adj. 连续 的 ， 连 贯 的 suffice vi. WB; 有 能 力 
contiguous adj. 紧密 连接 的 vt， 满 足 …… 的 需要 ; 使 满足 
indices n. index 的 复数 ; 索引 iteration n. 3E fX 
retrieve ut. RÆ debug vt. 排除 故障 ， 程 序 调试 命令 
linear adj. 线性 的 ,一 次 的 subarray n， 子 数组 
sentinel n， 哨 兵 iterate vt. HAZ; 反复 申明 
terminate vt. & vi. 结束 , 使 终结 convention n. “je, 惯例 
manipulations n， 操 作 ， 操 纵 increment n. 长 
Notes gt 

1. overhead; 额外 开销 。 除 ce ， 还 有 很 多 其 他 信息 ， 这 些 信息 用 来 保证 
数据 结构 。 这 些 信息 被 称 作 和 额外 开 

2. row and column "M or odorem y. 


CEN. 


Exercises 
E Ww 
Te ee phrases Xe 


. one-dimensional array 

. sentinel value 

. three-element array 

. contiguous memory 

. row-by-row iteration 

. column-by-column iteration 


. multidimensional array 


oo -3 OQ Cc p pmo 


. the programming language C 











I. Fill in the blanks with the missing word(s) from the table below. 





























applications representation scattering median 
successively additional contiguous value 
element string remaining binary 
consists drawbacks linear subtree 
volumes deleted compressed associated 
1. One of the of representing arrays in ____—s memory is the time re- 
quired to initialize them; the obvious method of — le element to its initial 
value uses time proportional to the number of elements. SN 
2. The contiguous methods of the prec: D tion allocate storage for every 
of an array. But in many the arya aR we consideration are only partial- 







ly filled. Sometimes only a of the elen f an array have useful values. 


~ 


3. Decoding is almost the same as ei "irst of all, the representation 






simply of a sequence of cod. r$; it is easy to retrieve them one by one since 











the length in bits of a single cod r is fixed. 

4. Retrieval of orm n large text AX yis broad and important prob- 
lem-numerous techniqués een developed, ania . . . . written on the topic. 
One simple Lc is hing. 

5. A bina dm tree is a binary ving a associated with each node, 
such that thi s have a order! and at each node the value is greater than any 


value in the left subtree and less than any value in the right 
6. When a node is from a binary search tree, the inorder traversal of the 
nodes must yield the keys in the same order they had before the deletion. 
7. Ina split tree each node contains two keys. One is called the node value; 
any information in the node (the Info field) is associated with this key. 
8. The data structure is a pure search tree—the nodes have no balance. color. or oth- 
er auxiliary fields, only left and right child pointers and fields for the key itself and any 
data. 


II. Translate the following paragraphs into Chinese. 


1. A greedy algorithm is based on the following simple principle: when called on to 
make a sequence of choices to develop a solution to a problem, always make the choice that 


has the lowest immediately visible cost; don't try to look ahead to see if that choice might 
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turn out to be more costly in the long run. The first thing to note about greedy algorithms 


is that. for most problems (as in most real-life situations), they do not work. 

















2. Algorithms that solve very different problems sometimes bear a strong family re- 
semblance to each other. For example. the general strategy of Divide-and-Conquer under- 


lies Binary Search, Merge Sort, and the clever integer multiplication algorithm, as well as 


and-Conquer in mind when faced with new problems to s 


examples of several other general strategies for the XL 


Readin ramming a, ngineering Activity 
on X5 


n this section we will give 


useful algorithms for many other problems. It is worth keepj eneral idea of Divide- 
g 


lgorithms. 

















/ 

A gis a solution to a problem. The problem might be very specific and well-de- 
fined, for example. to calculate the square roots of the integers from 1 to 100 to ten deci- 
mal places. Or the problem might be vast and vague. for example. to develop a system for 
printing books by computer. Large. ill defined problems are. however. best solved by 
breaking them down into smaller and more specific problems. As a part of the problem of 
printing books by computer. for example. we might need to determine the places where a 
word could be hyphenated if it had to be split across two lines. Our subject matter is pro- 
gramming problems that are specific enough that we can describe them in a few words and 
can judge readily what is a solution and what isn't. but are common enough that they come 
up over and over again in the solution of larger programming problems. 

Even for problems that can be described very exactly in a few words. however. there 
can be many possible solutions. Of course one can always get different programs by chan- 
ging variable names. translating from FORTRAN to Pascal. and the like. But there can be 


solutions that differ in more fundamental ways. that use quite different approaches or 











methods to solve a problem. Consider. for example. the problem of finding a word K in a 
sorted table of words. 

Here are three approaches. 

A. Start at the beginning of the table and go through it, comparing K to each word in 
the table, until you find K or reach the end of the table. Of course that way 
doesn't take advantage of the fact that the table is sorted. 

Here's a slightly more intelligent variation. 

B. Start at the beginning of the table and go through it as in (A), stopping when you 
find K or another word that should come after K in the table. or when you reach 
the end of the table. 

Changing the stopping condition in this way eliminates necessary work done 
by method (A). If we're looking for a ardvark, for exam} nces are we won't have to 
look long if we use method (B). But there is a bett 

C. Start in the middle of the table. If K is 
Otherwise. decide by looking at Ehe 






idNle word in the table. you're done. 


word whether K would be in the first 


half of the table or the second, A CANA at the same process on one half of the ta- 
ble. On subsequent MA Ch a quarter. an eighth，…of the table in the 
same way. Stop "S K or have P d the size of the table 
you're searching. zv 


Method (C) is zm search and WE. fastest of the three. (It's also 
"T" 1 


the trickiest to Mis rrectly. Actuall: lescription leaves out a lot of important 
details; for A hich element is in iddle" of a table of length 10?) We'll get to 
ouUnro 


a detailed acc 





Í binary search in the previous papers. but for now there are a few mor- 
als to be drawn from the example. First. CA). (B). and (C) are different algorithms for 
the same problem. None of them is a program, since the language used to describe them 
isn't a programming language. But any programmer would understand these descriptions. 
and would understand that FORTRAN and Pascal implementations of (C) embody the 
same algorithm. whereas Pascal implementations of (A) and (C) embody utterly different 
algorithms. 

An algorithm is a computational method used for solving a problem. The goals of this 
book are to teach you some of the most important algorithms for solving problems that 
come up over and over again in computer programming. and to teach you how to decide 
which algorithm to use when you have a choice (as you almost always do). 

We might choose one algorithm over another because it is always faster. or because it 
is usually faster. or because it uses less memory. Or we might choose an algorithm because 


it is easier to program. or because it is more general and we want to anticipate the possibili- 
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ty that the problem we are solving might change in the future. For our purposes in this 
book. however. we will mostly be looking at the speed of algorithms. and how much mem- 
ory they use. 

Of course we are not going to determine the speed of an algorithm by writing a pro- 
gram and then timing it. The numbers obtained in this way would depend too much on the 
quality of the programmer and the speed of the particular computer to be of general interest 
or applicability. Instead. we'll try to think in more abstract. mathematical terms. If the 
table has length n, then method (A) takes time proportional to n; double the size of the ta- 
ble and the algorithm will take roughly twice as long. Method (C). on the other hand. 
takes time proportional to the base 2 logarithm of n at worst (singe that is the number of 
times you can divide a table of length n in half before it is re a single element). 

We'll spend a good deal of time on this business of al analysis, but again a few 
simple morals will suffice for now. We want to use M ical tools for analyzing the al- 
gorithms we consider. since the right mathemati will give us conclusions that hold 
for all implementations. z^ 

To develop those mathematical tools ave to come up with mathematical models 
for the situations we are trying to tid. For example. to conclude that method (A) 
takes time proportional to the Jent o$ the table, we aime only that it always takes 
the same amount of time to Eédffom any element ae e to the next. That is true for 


a great many ways of i enting tables. so weak assumption we can draw a con- 


clusion of quite genéigl a plicability. 

Progra g Y engineering activ Tt isn't pure science. or pure mathematics ei- 
ther; when we)y¥ite programs. we can't ignore annoying details of practical importance. 
and we're not working in an environment where there's only one right answer. Engineers 
make design decisions based on an understanding of the consequences of alternative 
choices. That understanding comes from a knowledge of laws. usually stated in mathemat- 
ical terms, that cover a broad variety of situations, An engineer decides what kind of 
bridge to build to span a river at a particular spot by sizing up the parameters of the situa- 
tion Chow long? how much weight to be borne?) and applying the general laws that charac- 
terize the behavior of various kinds of bridges. An engineer will also bring to bear the wis- 
dom of experience accumulated by witnessing the construction of the things that have been 
designed. Programmers should think the same way; they need both an understanding of 
the general laws that govern the performance of algorithms. and the practical wisdom that 


comes from having attempted to implement them. 
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New Words and Phrases 


square roots n. 平方 根 aardvark n. 非洲 食 蚁 兽 
vague adj. 模糊 的 proportional adj. 成 比例 的 
hyphenate vt， 以 连 字 符号 连接 witness vi. 目击 ， 见 证 
Exercises 


I. Answer the following questions. 


1. How to find a word K in a sorted table of ws 

2. Why programming is an engineering acti 

3. Why algorithm is important to S, ? 

Il. Translate the following sent io'Chinese. 

l. A program is a solutio; ilem The iei be very specific and well- 
defined-for example. to init square roots b integers from 1 to 100 to ten deci- 
mal places. Or the Nai be vast and or example. to develop a system for 
printing books De T. x S $ 

ANA Lx 


2. Asa part of the problem of printing books by computer. for example. we might 














need to determine the places where a word could be hyphenated if it had to be split across 
two lines. Our subject matter is programming problems that are specific enough that we 
can describe them in a few words and can judge readily what is a solution and what isn't. 
but are common enough that they come up over and over again in the solution of larger pro- 


gramming problems. 


























3. Programming is an engineering activity. It isn't pure science. or pure mathematics 
either; when we write programs. we can't ignore annoying details of practical importance, 


and we're not working in an environment where there's only one right answer. 














Abstract Reading e 
Finding Dominators via Disjoint Set Union a" 


The problem of finding dominators in a, di; graph has many important applica- 







= 


tions, notably in global optimization of c code. Although linear and near-linear- 


time algorithms exist. they use sophi ata structures, We develop an algorithm for 


finding dominators that uses onl static tree" disjointyset data structure in addition to 
simple lists and maps. The al&drithn runs in near-li he time. depending on the 
implementation of the disj , 


et data structure. ive several versions of the algo- 





rithm, including one t nputes loop er rmation (needed in many kinds of 
global code opti Kin) and that can ka Self-certifying. so that the correctness of 
the computed inators is very easy to véfrfy. 


Strict Fibonacci Heaps 


We present the first pointer-based heap implementation with time bounds matching 
those of Fibonacci heaps in the worst case. We support make-heap. insert, find-min, meld 
and decrease-key in worst-case O(1) time. and delete and delete-min in worst-case O(lg n) 
time, where n is the size of the heap. The data structure uses linear space. 

A previous. very complicated. solution achieving the same time bounds in the RAM 
model made essential use of arrays and extensive use of redundant counter schemes to 
maintain balance. Our solution uses neither. Our key simplification is to discard the struc- 
ture of the smaller heap when doing a meld. We use the pigeonhole principle in place of the 


redundant counter mechanism. 
Deletion without Rebalancing in Balanced Binary Trees 


We address the vexing issue of deletions in balanced trees. Rebalancing after a deletion 
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is generally more complicated than rebalancing after an insertion。 Textbooks neglect dele- 





tion rebalancing. and many database systems do not do it. We describe a relaxation of 
AVL trees in which rebalancing is done after insertions but not after deletions. yet access 
time remains logarithmic in the number of insertions. For many applications of balanced 
trees, our structure offers performance competitive with that of classical balanced trees. 
With the addition of periodic rebuilding. the performance of our structure is theoretically 
superior to that of many if not all classic balanced tree structures. Our structure needs O 
Cog log m) bits of balance information per node. where m is the number of insertions, or 
O(log log n) with periodic rebuilding. where n is the number of nodes. An insertion takes 
up to two rotations and O(1) amortized time. Using an analysis that relies on an exponen- 


tial potential function, we show that rebalancing steps occur equency that is expo- 


nentially small in the height of the affected node. 


x 
ai S 
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INFORMATION SECURITY 


Text: Psychological Securi raps 


NF 


During my career of attacking software an ilities they power. many colleagues 





have remarked that I have a somewhat d approach. I tended to be surprised to 


hear this, as the approach seemed logi straightforward to me. In contrast, I felt that 






academic approaches were too ahs 


o realize wide spé¢ess in real-world applications. 
These more conventional ai S vere taking an almós mpletely random tack with no 
focus or. on the opposite/en the spectrum, sp 





hundreds of hours reverse-engi- 


neering and tracing-Applidations to ( hopefull igcover their vulnerabilities before they 
were Ma € the field. RX 
Now. pleag&do not take this the wrong way. I'm not condemning the aforementioned 


techniques. In fact I agree they are critical tools in the art of vulnerability discovery and ex- 
ploitation. However. I believe in applying some shortcuts and alternative views to enve- 
lope, enhance. and—sometimes—by pass these approaches. 

In this article I'll talk about some of these alternative views and how they can help us 
get inside the mind of the developer whose code or system we engage as security profes- 
sionals. Why might you want to get inside the mind of the developer? There are many rea- 
sons, but for this article we will focus on various constraints that are imposed on the crea- 
tion of code and the people who write it. These issues often result in suboptimal systems 
Írom the security viewpoint. and by understanding some of the environmental. psychologi- 
cal. and philosophical frameworks in which the coding is done. we can shine a spotlight on 
which areas of a systemare more likely to contain vulnerabilities that attackers can exploit. 
Where appropriate, I'll share anecdotes to provide examples of the mindset issue at hand. 


My focus for the past several years has been on large-scale environments such as major 
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corporations, government agencies and their various enclaves, and even nation states. 
While many of the elements are applicable to smaller environments. and even to individu- 
als. I like to show the issues in larger terms to offer a broader social picture. Of course, 
painting with such a broad brush requires generalizations, and you may be able to find in- 
stances that contradict the examples. 

The goal here is not to highlight particular technologies, but rather to talk about some 
environmental and psychological situations that caused weak security to come into being. It 
is important to consider the external influences and restrictions placed on the implementers 
of a technology, in order to best understand where weaknesses will logically be introduced. 
While this is an enjoyable mental game to play on the offensive sidg of the coin, it takes on 
new dimensions when the defenders also play the game and nt errors that would 
otherwise lead to attacks or b) use these same techniques me the attackers and how 
they operate!!! 

At this point, the security game becomes sider beautiful. The mindsets I'll 
cover fall into the categories of learned help sness/and naïveté, confirmation traps, and 
functional fixation. This is not an exhaxs. K Xt of influencing factors in security design 
and implementation, but a starting. 9 encourage further awareness of the potential 


ate or depend oj 
> 
Learned Helplessn iveté Wt 
Sociologists aie have MAT phenomenon in both humans and oth- 
: ey 


all learned helpleś It springs from repeated frustration when 


security dangers in systems that 





er animals t 

trying to MN ne's goals or rescue oneself from a bad situation. Ultimately, the anima 

subjected to this extremely destructive treatment stops trying. Even when chances to do 

well or escape come along, the animal remains passive and fails to take advantage of them. 
To illustrate that even sophisticated and rational software engineers are subject to this 

debilitating flaw, I'll use an example where poor security can be traced back to the roots o 

backward compatibility. Backward compatibility is a perennial problem for existing tech- 


nology deployments. New technologies are discovered and need to be deployed that are in- 





compatible with. or at the very least substantially different from. existing solutions. 





At each point in a system's evolution. vendors need to determine whether they wil 
forcibly end-of-life the existing solutions. provide a migration path. or devise a way to 
allow both the legacy and modern solutions to interact in perpetuity. All of these deci- 
sions have numerous ramifications from both business and technology perspectives. 
But the decision is usually driven by business desires and comes down as a decree to the 


developers and engineers. When this happens. the people responsible for creating the 
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actual implementation will have the impression that the decision has already been made 
and that they just have to live with it. No further reevaluation or double guessing need 
take place. 

Tmagine that the decision was made to maintaincompatibility with the legacy technolo- 
gy in its replacement. Management further decrees that no further development or support 
work will take place on the legacy solution. in order to encourage existing customers to mi- 
grate to the replacement. 

Although such decisions place burdens on the development in many ways 一 with secur- 
ity implications—they are particularly interesting when one solution. usually the new tech- 
nology, is more secure than the other. In fact, new technologies,are often developed ex- 
plicitly to meet the need for greater security—and yet the old,t gy must still be sup- 
ported. What security problems arise in such situations? « 

There are different ways to achieve backward e ility, some more secure than 
others. But once the developers understand thatktie Dider. less secure technology is al- 
lowed to live on, solutions that would ease the;riXfere often not considered at all. The fo- 
cus is placed on the new technology. and’ th cy technology is glued into it Cor vice ver- 


sa) with minimal attention to the X ects. After all, the team that is implementing 


the new technology usually didn' op the legacy coque goal is to ultimately sup- 
plant the legacy solution an 5 right? 

The most direct Ma is to compromisest es and security strength of the 
new technology sd hat of the legacyzsolttisfi. in essence allowing both the modern 
and legacy te: logf?to be active simult Sly. Learned helplessness enters when devel- 
opers can't DNC; that anything could be done—or worse, even should be done—to miti- 
gate the vulnerabilities of the legacy code. The legacy code was forced on them, it is not 
perceived to be their bailiwick (even if it impacts the security of the new technology by re- 
ducing it to the level of the old). and they feel they are powerless to do anything about it 


anyway due to corporate decree, 


New Words and Phrases 








spectrum n. jf enclave 7。 其 他 领域 ; 飞 地 ( 指 在 本 国境 内 
aforementioned adj. FIÈ 的 隶属 另 一 国 的 一 块 领土 ) 

suboptimal adj. 次 优 的 contradict vt. 与 …… 了 矛盾 

spotlight n. RE implementer ». 实施 者 

system n. 系统 naiveté adj. 天真 的 


anecdote 7。 轶 事 springs vi. 使 …… Jr 











perennial adj. 多 年 的 , 长久 的 implication n. 影响 


incompatible adj. 不 匹配 的 ,不 相 容 的 explicitly adv. 明确 地 
end-of-life n. 报废 minimal adj. 最 小 的 ， 极 少 的 ; 极 小 的 
perpetuity n. 永久 supplant vi. 把 …… 排 挤 掉 ,取代 ; 代替 
ramification n. 结果 ,分 支 robustness n. 坚固 性 ， 健 壮 性 ; SHEHE 
compatibility n. 兼容 性 bailiwick n. 工作 区 间 ; 辖区 
legacy technology 传统 技术 decree n. ik4. 命令 ; (法 院 的) 判决 ， 裁 定 
Notes 

1, While this is an enjoyable mental game to play NAME side of the coin, it 
takes on new dimensions when the defenders also vos e and a) prevent errors that 
would otherwise lead to attacks or b) use these & teshniques to game the attackers and 


how they operate. * 
MAE BG A C BG IEA AT RE AIRS woe TAAR RAAB 
击 的 错误 或 者 运用 这 些 相同 的 技巧 ; 击 者 及 他 们 的 操作 时 ， 这 个 游戏 就 会 上 升 到 一 


mE E XP 
Exercises á X- V 


A DA 
Ts pedoriate the following words*and phrases into Chinese. 


random tack 
reverse-engineering 
tracing application 
functional fixation 
confirmation trap 
debilitating flaw 
migration path 


legacy technology 


£e 9 GEE Oe S» oe cnr de pr 


backward compatibility 
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I. Fill in the blanks with the missing word(s) from the table below. 





























hash compatibility valid capability 
negative confirmation agency multiple 
alternative embodied refute password 
proximity notion ensuring fixation 
corporate inability solution permanently 
l. In short. the problems of the new modern security sprang from the wea- 
ker LANMAN of the legacy system and thus € a security profile to 
its lowest common denominator. It wasn't until much laters & er much secu- 


rity publicity. that Microsoft introduced the ng only one hash or the oth- 


Se 
er, and not both by default—and even later that t aS storing both LANMAN and 


NT hashes in to each other on loca] s; s: 
mss : + : I 
2. The issue from a security stan omes how to accomplish this backward 


without degrading the sec 






the new systems. Microsoft's naive solution 

pretty much the worst sibilities: it st the insecure together 

with the more secure one. H e benefit of the AS transmitted the representa- 
even w] eec 

9r uw s gence analyst k a three-letter . The analyst 


tions of both hashes ovégthe‘network, e 





wants to create pse ph reports in order t éss up the career ladder. The analyst culls 
information sources. includihg the previous reports of analysts in her posi- 
tion. 

4. Functional fixation is the to see uses for something beyond the use com- 


monly presented for it. Thi similar to the of first impressions—that the first 





spin applied to initial information disclosure (e. g. . a biased title in a newspaper report or a 
presentation of a case by a prosecutor) often influences the listener's ongoing per- 
ception of the information. 

5. One of the greatest hampers to security springs from negative perceptions of secur- 
ity requirements at a high level. Some of these represent functional s 

6. We can overcome learned helplessness and naiveté by that initial decisions 
do not shut off creative thinking. 

7. We can overcome traps by seeking inputs from diverse populations and 
forcing ourselves to try to assumptions. 


8. We can overcome functional fixation by looking for uses for our tools. as 
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well as alternative paths to achieve our goals. 
II. Translate the following paragraphs into Chinese. 


1. Many people think of security products such as vulnerability scanners and anti-vi- 
rus software as tools that increase the security of a system or organization. But if this is 
the only view you hold, you are suffering from functional fixation. Each of these technolo- 
gies can be very complex and consist of thousands of lines of code. Introducing them into 


an environment also introduces a strong possibility of new vulnerabilities and attack sur- 


A 
K 


2. Modern anti-virus software, pe sv: as been found to include all sorts of 


faces. 

















common programming vulnerabilities. s; cal buffer overflows. unchecked execution 


capabilities, and lack of authenticatjo} auto-update activities, This security software, 


therefore, can also become the, o) for attackers rath n the defense it was intended 


for. 2 d et 


"A 


3. Microsoft preferred for all their customers to upgrade to newer versions of Win- 























dows. of course. but did not dare to cut off customers using older versions or even retrofit 
them with the new hash function. Because the password was a key part of networking. 
they had to assume that. for the foreseeable future. old systems with no understanding of 
the new hash function would continue to connect to systems fitted out with the more secure 
hash. 
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Reading: Sunk Costs versus Future Profits: An Energy Example 


Part of my career has involved examining in detail the back end control systems at va- 
rious electric utilities and, to a somewhat lesser extent, oil company backend systems. I 
assessed how they were protected and traced their interconnections to other systems and 
networks. It was surprising how the oil and electric industries. while using such similar 


systems and protocols. could be operated and run in such widely disparate configurations 


and security postures. 
To put it politely, the electric company networks we drm control systems 


re 
and networks could be reached from the public Internet. e systems were be- 
ing shared by multiple tasks. interleaving word pr and other routine work with 
critical functions that should have been relegate: lized systems to prevent potential 
interference or disruption of operations. It appeaieu/n several cases that systems and net- 
works had been put together on a whim/And Without consideration of optimal or even accu- 
rate operations, Implementers move the next job as soon as things worked at all. 
Many plant control networks, p ormation news ad corporate LANs had no 
firewalls or chokepoints. F Dicey snr combined to create the poten- 
c 


tial for malicious interlope: 





o wreak serious ncluding manipulating or disrupting 


the physical con ed in the produgtion transmission of power. 

Convers thd few offshore oil systenss that I had looked at. while utilizing similar 
SCADA eS configured and operated in a different fashion, Plant control and in- 
formation networks were strictly segregated from the corporate LAN. Most critical sys- 
tems were set correctly to have their results and status handled by a librarian system that 
then pushed the information out in a diode fashion to higher analysis systems. Concise and 
efficient network diagrams resulted in crisp and clean implementations of SCADA and DCS 
systems in the physical world. including restriction of access that resulted in effective secu- 
rity. In many cases the components were custom systems designed and configured to per- 
form only specific functions. 

The contrast between the electric and oil organizations intrigued and worried me. As 
fate would have it, I was in the position to be able to call a meeting about this subject with 
some high ranking technical people from electric companies. oil companies. and govern- 
ment (think spook) agencies. 

The first salient aspect that surprised me from the meeting was that the people from 


the electric utilities and their electric utility oversight and clearinghouse organizations did 
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not refute my statements regarding the poor—or completely missing— security on their net- 
works and systems. This surprised me because the electric companies were publicly deny- 
ing that they had any cyber-system risk. In our meeting they pointed out some examples 
where security had been implemented correctly—but they acknowledged that these exam- 
ples were exceptions and not the norm. 

My second surprise came when the oil companies stated that they did not go about de- 
signing their systems from a security perspective at all. and that although security was im- 
portant, it was not the business driver for how things were configured. The primary driver 
was to have an edge against their direct competitors. If company A could make a critical 
component operate at 5/4 greater efficiency than company B. the increased operational ca- 
pacity or reduction in overhead rewarded company A over ti arge sums of money. 
Examples of how to increase such efficiency included: N 

(1) Forced separation and segregation of ds ent critical systems from in- 
curring added latency from being queried by ma: and reporting requests, 

(2) Utilizing special-purpose systems d si o accomplish specific tasks in place of 
general purpose nonoptimized systems. 

These efficiencies benefited secur well. The first created strong, clean, and en- 


forceable boundaries in UM stems, The V pce systems with smaller 
~ 
S 


surface areas to attack. 







nd system boundary X«L obvious effect, but the case of 


smaller surface M s ves a brief examigari magine that you have a general-purpose 
INCH nfiguration, The {ült configuration might have several services 


already config: 


Enforceable netwo: 





system in its 
and running, as well as many local daemons executing to assist user pro- 
cessing. This allows the system to be deployed in the largest number of settings with mini- 
mal reconfiguration required. The systems’ vendor prefers such systems with broad capa- 
bilities because they make installation easier. 

However, this doesn’t mean that the default setting is optimal for the majority of con- 
sumers, just that it is acceptable. In the default setting, each of the running services is an 
attack surface that may be exploited. Similarly. client applications may be compromised 
through malicious input from compromised or falsified servers. The more services and cli- 
ent applications that are running on the system, the greater the attack surface and the grea- 
ter the likelihood that the system can be remotely or locally compromised. 

Having a large attack surface is not a good thing. but the drawback of generality ex- 
amined by the oil companies was the systems' suboptimal performance. For each running 
program. which includes server services as well as local applications. the kernel and CPU 


devotes processing time. If there are many running applications. the system has to time- 
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slice among them. a kernel activity that in itself eats up resources. 

However. if there are few running applications. each one can have a greater number of 
CPU slices and achieve greater performance. A simple way to slim down the system is to 
remove superfluous services and applications and optimize the systems to run in the most 
stripped-down and dedicated fashion possible. Another way is to deploy systems dedicated 
to specific functions without even the capability of running unrelated routines. These tac- 
tics had been used by the oil companies in the offshore rigs I had examined in order to max- 
imize performance and thus profits. 

Why hadn't the electric utilities gone through the same exercise as the oil companies? 
At first, electric companies were regulated monopolies. Where these companies did not 
need to be competitive, they had no drive to design optimize dcn structured envi- 
ronments, 

One would be tempted to assume that deii posure of electric companies 
to a competitive environment would improve thej idi 
as oil companies) their security. ade 


panies were turned loose. so to speak. a 





cy and (following the same path 
site occurred. When the electric com- 
Zed they needed cost-cutting measures to be 
competitive, their first steps were to če workforce. They ended up assigning fewer 
people to maintain and work, ori same number oun remote systems (often 


operations rather than looking 


through remote access vedi » focusing on da Aa 
ahead to long-term ee usually a NG e for efficiency or security. 
il 





servation I made in the previous section 





The story MS mpanies confirmycrh 
一 
about the ISR Mosi] érganizations think uriy as a sunk cost. insofar as they think of 
it at all. Securpy“approached in this fashion will likely be inadequate or worse. If. howev- 
er, one focuses on optimizing and streamlining the functionality of the networks and sys- 
tems for specific business purposes, security can often be realized as a by-product. And 
once again, security professionals can further their cause by overcoming their functional 
fixation on security as a noble goal unto itself worth spending large sums on. and instead 


sometimes looking at sneaking security in as a fortuitous by-product. 


New words and Phrases 





backend n. 后 端 implementer n. 实施 者 
protocol n. (数据 传递 的 ) 协 议 chokepoint n. 阻塞 点 
configuration n. (计算 机 的 ) 配 置 malicious adj. 恶意 的 ， 有 敌意 的 ; BEM; 
disruption n. 4¢ 32. 瓦解 ; 破裂 ， 毁坏 ; 预谋 的 ; 存心 不 良 的 
中 断 havoc n. 大 破坏 ,浩劫 ; ERE. HERR 








offshore adj. 离开 海岸 的 ; 近海 的 ; 海外 falsify vr. Bek. uas 









































的 ， 国 外 的 superfluous adj. 过 多 的 ; 多 余 的 ; 不 必要 
utility n， 实 验 程序 ; 公用 事业 的 ; 奢侈 的 
segregate vt. WARF; 分 离 ; 隔离 stripped-down adj. 无 装饰 的 ; 简装 的 
diode n. ZARES tactic n. 策略 ， 战 略 
configure n， 配 置 ; 设 定 ; 使 成 形 ; 使 具 一 定 monopoly n. ÈW 垄断 者 ; 专卖 权 

形式 insofar adv. f£: 的 范围 ; 在 …… 情 况 下 
salient adj. 显著 的 ， 突 出 的 ; 重要 的 ， 主 optimize vt， 使 最 优化 ， 使 完善 

要 的 ; 跳跃 的 streamline vt. {E+ 做 成 流线型 ; 组 织 ; 
segregation n. 分离 ， 隔 离 使 合理 化 ; 使 简单 化 
latency n， 潜 伏 ; 潜在 因素 by-product n. ow 附带 产生 的 结果 ; 


non optimized adj. 非 优 化 的 E. 
enforceable adj. 可 实施 的 ; 强行 的 ; 可 强  fixatio: 定位 ; 定 影 


jay $e 直到 
reconfiguration n， 重 新 配置 ,再 组 和 ; 结 We ladj. 暗中 进行 的 


构 变 形 Q^ Ai a 偶然 的 ; 不 规则 的 
Exercises A yA% 


E wv 
I. Answer tonne questions. XE VÀ 


1. What's simple way to slim Ph the system? 
2. Why hadn't the electric utilities gone through the same exercise as the oil compa- 


nies? 
]. Translate the following sentences into Chinese. 


l. General-purpose systems were being shared by multiple tasks. interleaving word 
processing and other routine work with critical functions that should have been relegated to 


specialized systems to prevent potential interference or disruption of operations. 














2. The first salient aspect that surprised me from the meeting was that the people 


from the electric utilities and their electric utility oversight and clearinghouse organizations 
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did not refute my statements regarding the poor—or completely missing—security on their 


networks and systems. 














3. One would be tempted to assume that deregulation and exposure of electric compa- 
nies to a competitive environment would improve their efficiency and (following the same 


path as oil companies) their security. 

















An Information-theoretic aT uation of a Cd Encryption Schemes 
e 


Randomized encryp chniques. where, r Nitro ass is used for security enhance- 
ment. are conside focus on the cağ the encrypted data experiences noise. 
e.g..is NN er a noisy channel séifhin the encoding-encryption paradigm. where 
the data is fi a for error correction. before being encrypted for security. We as- 





sume that the ciphertext is subject to a corruption equivalent to its transmission through a 
binary symmetric channel with known probability of error. The enhanced security is based 
on a dedicated wire-tap channel coding that introduces extra randomness. combined with 
that of the communication channel noise. The encryption is based on a block-by-block mod- 
ulo 2 addition between an encoded message vector and a pseudorandom vector. The goal is 
to enhance the protection of the secret key employed in the encryption algorithm. Security 
evaluations of the model are performed employing an information-theoretic approach. As- 
suming both a passive and an active attacker. we show that there is a threshold before 
which the wire-tap encoder guarantees an information-theoretic security (during which the 
equivocation of the secret key is increased), and after which the uncertainty reduces. ente- 
ring a regime in which a computational security analysis is needed for estimating the com- 


plexity resistance against the secret key recovery. 
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Adaptive Self-embedding Scheme With Controlled Reconstruction Performance 


In this paper. we address the problem of adaptive self-embedding. where the recon- 
struction quality is controlled individually for different fragments of a digital image. We fo- 
cus on the impact of incorporating content adaptivity features on the restoration success 
conditions and the achievable reconstruction performance. We analyze the problem theoret- 
ically and validate the obtained results experimentally with a fully functional selfembedding 
scheme. Our analysis shows that introduction of multiple reconstruction profiles. even 
with significantly lower restoration fidelity. does not need to improve the achievable tam- 
pering rate bounds. The obtained fine-grained control over the reconstruction process is ex- 


ploited to provide guarantees on certain performance aspects. n the derived theoret- 





ical model, we propose a procedure for optimization of X rall reconstruction quality 


given constraints on the desired target tampering rat 


lected image fragments. Such guarantees are of i 
cations. " x; 
Design and Performance Analysi M Ring Architecture for Smart Grid Privacy 


The traditional ace has become wa Ean the needs and demands 


of electricity users in th«21* century. To add 


required quality level for se- 


importance in a number of appli- 





s challenge, smart grid technologies 





have emerged. w] ise more efficieng pr tion and usage of electricity through bi- 
directional i ctiohà between the cons and the utility provider. This two-way inter- 
action allows eJectricity to be generated in real time based on the actual needs of the con- 
sumers, However. this two-way interaction also raises concerns related to the privacy and 
the personal habits of consumers, To protect sensitive energy usage information of con- 
sumers, we propose a virtual ring architecture that can provide a privacy protection solu- 
tion using symmetric or asymmetric encryptions of customers’ requests belonging to the 
same group. We compare the efficiency of our proposed approach with two recently pro- 
posed smart grid privacy approaches namely. one based on blind signature and other based 
on a homomorphic encryption solution. We show that our approach maintains the privacy 
of customers while reducing the performance overhead of cryptographic computations by 
more than a factor of 2 when compared with the aforementioned past solutions. We further 
demonstrate that our smart grid privacy solution is simple. scalable. cost-effective. and in- 


curs minimal computational processing overheads. 





unit Z 


COMPUTER SCIENCE 


Text: Ethical Issues for ON cientists 


starting point for any discussion of pro- 


ture. This observation, now commonplage 
fessionalism and ethics in computing. aut 

The work of computer scien: engineers is p: the social. political. econom- 
ic, and cultural world in aps ive. and it affects eon of that world. Profes- 


sionals who work with térs have special EON e. That knowledge. when com- 


bined with comput 
technical systems; p 


In this p NOU. provide a perspectivelon the role of computer and engineering profes- 


Computers have had a powerful impact SN Ñ and are destined to shape our fu- 






s significant powe; T people's lives—by changing socio- 





al, political and eif ić institutions; and social relationships. 


sionals and we examine the relationships and responsibilities that go with having and using 
computing expertise. In addition to the topic of professional ethics. we briefly discuss sev- 


eral of the social-ethical issues created or exacerbated by the increasing power of computers 








and information technology: privacy. property. risk and reliability. and globalization. 
Computers. digital data. and telecommunications have changed work. travel. educa- 
tion, business. entertainment. government. and manufacturing. For example. work now 
increasingly involves sitting in front of a computer screen and using a keyboard to make 
things happen in a manufacturing process or to keep track of records. In the past. these 
same tasks would have involved physically lifting. pushing. and twisting or using pens. 
paper. and file cabinets. Changes such as these in the way we do things have. in turn. fun- 
damentally changed who we are as individuals. communities. and nations. Some would ar- 
gue, for example. that new kinds of communities (e. g. . cyberspace on the Internet) are 


forming. individuals are developing new types of personal identities. and new forms of au- 








thority and control are taking hold as a result of this evolving technology. 

Computer technology is shaped by social-cultural concepts. laws. the economy. and 
politics. These same concepts. laws. and institutions have been pressured. challenged. 
and modified by computer technology. Technological advances can antiquate laws. con- 
cepts, and traditions, compelling us to reinterpret and create new laws, concepts, and 
moral notions. Our attitudes about work and play. our values. and our laws and customs 
are deeply involved in technological change. 

When it comes to the social-ethical issues surrounding computers. some have argued 


that the issues are not unique. All of the ethical issues raised by computer technology can. 





it is said. be classified and worked out using traditional moral concepts. distinctions. and 


sing traditional moral concepts. such as privacy. propert responsibility. and tradi- 


tional moral values, such as individual freedom, "xus countability, and communi- 
ty. These concepts and values predate computers; it would seem there is nothing u- 


nique about computer ethics. 


theories. There is nothing new here in the sense that we & nd the new issues u- 


~ 

On the other hand. those who arguk fo uniqueness of the issues point to the fun- 
damental ways in which computers ha ahged so many human activities. such as manu- 
facturing. record keeping. bank ternational trad cation. and communication. 


Taken together. these cha áre so radical. it,is + that traditional moral con- 








cepts, distinctions. an ories. if not abando ; st be significantly reinterpreted and 


extended. For exa » they must be exteyidedytb^computer-mediated relationships. com- 
— 
puter softwa: ter art, dataminiil iral systems, and so on. 
The uniqu of the ethical issues surrounding computers can be argued in a variety 


of ways. Computer technology makes possible a scale of activities not possible before. This 





includes a larger scale of record keeping of personal information. as well as larger-scale cal- 
culations which. in turn. allow us to build and do things not possible before. such as un- 
dertaking space travel and operating a global communication system. Among other things. 
the increased scale means finer-grained personal information collection and more precise da- 
ta matching and datamining. In addition to scale. computer technology has involved the 
creation of new kinds of entities for which no rules initially existed; entities such as com- 
puter files. computer programs. the Internet, Web browsers. cookies. and so on. The u- 
niqueness argument can also be made in terms of the power and pervasiveness of computer 
technology. Computers and information technology seem to be bringing about a magnitude 
of change comparable to that which took place during the Industrial Revolution. transfor- 
ming our social. economic. and political institutions; our understanding of what it means 


to be human; and the distribution of power in the world. Hence. it would seem that the is- 











sues are at least special. if not unique. 

In this paper we will take an approach that synthesizes these two views of computer 
ethies by assuming that the analysis of computer ethical issues involves both working on 
something new and drawing on something old. We will view issues in computer ethics as 
new species of older ethical problems. such that the issues can be understood using tradi- 
tional moral concepts such as autonomy. privacy. property. and responsibility. while at 
the same time recognizing that these concepts may have to be extended to what is new and 
special about computers and the situations they create. 

Most ethical issues arising around computers occur in contexts in which there are al- 


ready social, ethical, and legal norms. In these contexts. often there are implicit, if not 


e issues are not new or u- 





formal (legal), rules about how individuals are to behave; tl amiliar practices. so- 
cial meanings, interdependencies, and so on. In this res; S 

nique, or at least cannot be resolved without unders the prevailing context. mean- 
ings. and values, At the same time, the situati ve special features because of the 
involvement of computers—features that "SY b 


These features can make a moral differerige. 
uJ 


een addressed by prevailing norms. 

example. although property rights and e- 

ven intellectual property rights a m worked out long before the creation of soft- 

ware. when software first appea ibraised a new per issue, Should the ar- 

rangement of icons appeari E the screen of a AE ace be ownable? Is there any- 
e 


thing intrinsically wron, 





copying software? § as features that make the distinc- 


tion between ies Mer ession (a distination e core of copyright law) almost inco- 
e 
herent. I X 

As m s has features that ike standard intellectual property laws difficult 
to enforce. Hence, questions about what should be owned when it comes to software and 
how to evaluate violations of software ownership rights are not new in the sense that they 
are property rights issues. but they are new in the sense that nothing with the characteris- 
tics of software had been addressed before. We have. then. a new species of traditional 
property rights. 

Similarly. although our understanding of rights and responsibilities in the employer- 
employee relationship has been evolving for centuries. never before have employers had the 
capacity to monitor their workers electronically. keeping track of every keystroke. and re- 
cording and reviewing all work done by an employee (covertly or with prior consent). 
When we evaluate this new monitoring capability and ask whether employers should use it. 
we are working on an issue that has never arisen before. although many other issues invol- 
ving employer-employee rights have. We must address a new species of the tension be- 


tween employer-employee rights and interests. 








The social-ethical issues posed by computer technology are significant in their own 
right. but they are of special interest here because computer and engineering professionals 
bear responsibility for this technology. It is of critical importance that they understand the 
social change brought about by their work and the difficult social-ethical issues posed. Just 
as some have argued that the social-ethical issues posed by computer technology are not u- 
nique. some have argued that the issues of professional ethics surrounding computers are 
not unique. We propose. in parallel with our previous genus-species account. that the pro- 
fessional ethics issues arising for computer scientists and engineers are species of generic is- 
sues of professional ethics. All professionals have responsibilities to their employers. cli- 


challenge in all professions. Moreover. all professionals be responsibility for the 







ents, co-professionals, and the public. Managing these types [orsa poses a 


impact of their work. In this sense, the professional ethi s arising for computer sci- 


entists and engineers are generally similar to those in fessions. Nevertheless. it is 


also true to say that the issues arise in unique f 
because of the special features of computer, X 


New Words and Phrases "s 
destine vt. 注定 Spei 广大 ; EK, 重要 ; 量 级 


omputer scientists and engineers 

















socio-technical adj. #4 vx 流行 ， 盛 行 ; 获胜 ， 占 优势 ; 说 

exacerbate vt. 使 ， 劝 说 

file cabinet 7 iJ Pa adv. 从 本 质 上 ( 讲 ) 

cyberspace ní 子 计算 机 创造 的 ) 通 信 、 incoherent adj. 不 连贯 的 ; PARN; 无 
信息 空间 STER 

antiquate ut。 使 变 成 废弃 物 keystroke n， 击 键 ; 按键 

predate vt. 提早 日 期 fs vi. 用 键盘 输入 ; 击 打 …… 

reinterpret vt. 重新 解释 covertly adv. 偷偷 摸 摸 地 ; 秘密 地 

data mining 数据 挖掘 consent vi. 同意 ; 赞成 ; 答应 

entity n. 实体 ; 实际 存在 物 ; 本 质 n. 同意 ;意见 等 的 ) 一 致 ， 赞 成 

pervasiveness n. 无 处 不 在 ， 遍布 genus-species n. fif 

Notes 


l. intellectual property rights: 知识 产权 ， 指 权利 人 对 其 所 创作 的 智力 劳动 成 果 所 享 
有 的 专 有 权利 。 








Exercises 


I. Please translate the following words and phrases into Chinese. 


l. evolving technology 
professional ethics 

digital data 

personal identities 

virtual system 

global communication system 
data matching 


copyright law 


和 


monitoring capability 


I. Fill in the blanks with the missi 






























sequence NS concepts 

consequences ed analysis 
trade-off mifimized policy 
optimum m. classified IT queue 

ethical J dynamic y policies elements 











1. All of the ethical issues raised by computer technology can, it is said, be 


and worked out using traditional moral 


2. We can obtain a different performance 


» distinctions, and theories. 


by means of an array. which allows constant-time 


sequence given its position. 


3. Realizing a 


take OClog N) time. where N is the number of 


time the operation is performed. 


4. Our aim is not to propose. defend. or attack any particular 


er. we offer brief descriptions of three major and 


nature of ethical 


5. Utilitarianism has greatly influenced 20*-century thinking. especially insofar as it 


influenced the development of cost-benefit analysis. According to 





in the priority 


by implementing the sorted 


to any element of the 


queue with a heap has the advantage that all of the operations 


theory. Rath- 


ethical theories to illustrate the 


+ we should 








make decisions about what to do by focusing on the of actions and policies; we 
should choose actions and that bring about the best consequences. 

6. The emphasis on consequences found in utilitarianism is very much a part of per- 
sonal and policy decision making in our society. in particular as a for law and 
public . Cost-benefit and risk-benefit analysis are. for example, consequentialist 
in character. 

7. Optimization problems always have an function to be or maxi- 
mized, but it is not often clear what steps to take to reach the value. For exam- 


ple. in the optimum binary search tree problem of the previous section. we used 
programming to systematically examine all possible trees. 

II. Translate the following sentences into Chinese. «^ à 

l. Although usability test subjects normally es c 3) bodily harm—even from i- 
rate developers resenting the users’ mistreatme hele’ beloved software—test participa- 


tion can still be quite distressing. Users feel Cee dous pressure to perform, even when 


not the user. 
ae moral claims and commitments. It is 


not just a PAS iculating intuitio Ahen the reasons given for a claim are devel- 


oped into a mo eory, the theory can be incorporated into techniques for improved tech- 


told the study's purpose is to test the syStei 


2. Ethical a is involves giving ri 
























nical decision making. The three ethical theories described in this section represent three 
traditions in ethical analysis and problem solving. The account we give is not exhaustive. 
nor is our description of the three theories any more than a brief introduction. The three 


traditions are utilitarianism. deontology. and social contract theory. 
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Reading: Ethical Issues That Arise from Computer Technology 


The effects of a new technology on society can draw attention to an old issue and can 





change our understanding of that issue. The issues listed in this section—privacy, property 
rights. risk and reliability. and global communication—were of concern. even problematic. 
before computers were an important technology. 

But computing and, more generally. electronic telecommunications, have added new- 
twists and new intensity to each of these issues. Although computer professionals cannot 
be expected to be experts on all of these issues. it is AS t m to understand that 
computer technology is shaping the world. 


And it is important for them to keep these mesa S as they work with comput- 






er technology. Those who are aware of privacy iss syst are more likely to take 


those issues into account when they RM management systems; those who are 
aware of risk and reliability issues are RS to articulate these issues to clients and 


attend to them in design and documen 


Privacy X 


Privacy is a centraL'*ópic in E. ethi € have even suggested that privacy is 
iquated by tech: Ka 
openness. RNS that computers Be harnessed to help restore as much privacy 


as possible to 






a notion that has a that it should be replaced by a new 
Society. Although they may not like it, computer professionals are at the 
center of this controversy. Some are designers of the systems that facilitate information 
gathering and manipulation; others maintain and protect the information. As the saying 
goes, information is power —but power can be used or abused. 

Computer technology creates wide-ranging possibilities for tracking and monitoring of 
human behavior. Consider just two ways in which personal privacy may be affected by 
computer technology. First. because of the capacity of computers. massive amounts of in- 
formation can be gathered by record-keeping organizations such as banks, insurance com- 
panies. government agencies. and educational institutions. The information gathered can 
be kept and used indefinitely. and shared with other organizations rapidly and frequently. 
A second way in which computers have enhanced the possibilities for monitoring and track- 
ing of individuals is by making possible new kinds of information. When activities are done 
using a computer. transactional information is created. When individuals use automated 


bank teller machines. records are created; when certain software is operating. keystrokes 











on a computer keyboard are recorded; the content and destination of electronic mail can be 
tracked. and so on. With the assistance of newer technologies. much more of this transac- 
tional information is likely to be created. For example. television advertisers may be able 
to monitor television watchers with scanning devices that record who is sitting in a room fa- 
cing the television. Highway systems allow drivers to pass through toll booths without 
stopping as a beam reading a bar code on the automobile charges the toll. simultaneously 
creating a record of individual travel patterns. All of this information (transactional and 
otherwise) can be brought together to create a detailed portrait of a person's life, a portrait 
that the individual may never see. although it is used by others to make decisions about the 
individual. 


However. is it computer technology per se that poses t or is it just the way 





e technology has been used (and is likely to be used in t re)? Computer profession- 


Is might argue that they create the technology but t hesponsible for how it is used. 


2 


This argument is, however. problematic fo: r of reasons and perhaps foremost 







because it fails to recognize the potential SR some of the problems of abuse in the 


design of the technology. Computer profess s are in the ideal position to think about 


the potential problems with ne. rs © design so as to avoid these problems. When, 
instead of deflecting eoniceris | rivacy as out of th ryiew. computer professionals 


set their minds to solve pride and security prohl 
prove. 


At the ame Re think about chagigin) mputer technology. we also must ask 
ab 





stems they design can im- 





deeper ques privacy itself and is that individuals need. want. or are enti- 





tled to when they”express concerns about the loss of privacy. In this sense. computers and 
privacy issues are ethical issues. They compel us to ask deep questions about what makes 
for a good and just society. Should individuals have more choice about who has what infor- 
mation about them? What is the proper relationship between citizens and government. be- 
tween individuals and private corporations? How are we to negotiate the tension between 
the competing needs for privacy and security? As previously suggested. the questions are 
not completely new. but some of the possibilities created by computers are new, and these 
possibilities do not readily fit the concepts and frameworks used in the past. Although we 
cannot expect computer professionals to be experts on the philosophical and political analy- 


sis of privacy. it seems clear that the more they know. the better the computer technology 





they produce is likely to be. 
Property Rights and Computing 


The protection of intellectual property rights has become an active legal and ethical de- 
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bate. involving national and international players. Should software be copyrighted. paten- 
ted. or free? Is computer software a process. a creative work. a mathematical formalism. 
an idea. or some combination of these? What is society's stake in protecting software 
rights? What is society's stake in widely disseminating software? How do corporations and 
other institutions protect their rights to ideas developed by individuals? And what are the 
individuals’ rights? Such questions must be answered publicly through legislation, through 
corporate policies. and with the advice of computing professionals. Some of the answers 
will involve technical details. and all should be informed by ethical analysis and debate. 

An issue that has received a great deal of legal and public attention is the ownership of 
software. In the course of history, software is a relatively new entity. Whereas Western 
legal systems have developed property laws that encourage j by granting certain 
rights to inventors. there are provisions against obe that might interfere 
with the development of the technological arts and s ce& For this reason. copyrights 
protect only the expression of ideas, not the ideas Selves, and we do not grant patents 
on laws of nature, mathematical Muro S. 


d grant ownership of it without, in effect. 


ct ideas. The problem with computer 
software is that it has not been clear that we 


granting ownership of numerical sequ Or mental steps. Software can be copyrighted. 





because a copyright gives the ho! ownership of the e ion of the idea (not the idea 
itself), but this does not gi oftware inventors as à otection as they need to com- 
pete fairly. Competitors* nay, see the software A e idea. and write a somewhat dif- 
ferent program Ale ame thing. Theyom| or can sell the software at less cost be- 
cause the cos deyeloping the first so does not have to be paid. Patenting would 
provide BPN rotection, but until quite recently the courts have been reluctant to grant 
this protection because of the problem previously mentioned; patents on software would 
appear to give the holder control of the building blocks of the technology, an ownership 
comparable to owning ideas themselves. In other words, too many patents may interfere 
with technological development. 

Like the questions surrounding privacy, property rights in computer software also lead 
back to broader ethical and philosophical questions about what constitutes a just society. In 
computing, as in other areas of technology. we want a system of property rights that pro- 
motes invention (creativity. progress). but at the same time, we want a system that is fair 
in the sense that it rewards those who make significant contributions but does not give any- 
one so much control that others are prevented from creating. Policies with regard to prop- 
erty rights in computer software cannot be made without an understanding of the technolo- 
gy. This is why it is so important for computer professionals to be involved in public dis- 


cussion and policy setting on this topic. 








Risk, Reliability, and Accountability 


As computer technology becomes more important to the way we live. its risks become 
more worrisome. System errors can lead to physical danger. sometimes catastrophic in 
scale. There are security risks due to hackers and crackers. Unreliable data and intentional 
misinformation are risks that are increased because of the technical and economic character- 
istics of digital data. Furthermore. the use of computer programs is. in a practical sense. 
inherently unreliable. Each of these issues (and many more) requires computer profession- 
als to face the linked problems of risk. reliability. and accountability. Professionals must 


be candid about the risks of a particular application or system. Computing professionals 


should take the lead in educating customers and the public at predictions we can 
and cannot make about software and hardware reliability: uter professionals should 
make realistic assessments about costs and benefits. e willing to take on both for pro- 


jects in which they are involved. 
There are also issues of sharing risks pM resources, Should liability fall to the 
individual who buys software or to the/tor, ion that developed it? Should society ac- 


knowledge the inherent risks in usi y evolving globally networked telecommunica- 


tions? X 
The system of comput id connections ki Internet provides the infra- 
structure for new kinds?$f communities—elect«oni mmunities. Questions of individual 
accountability and ntrol. as well asa 
一 
nities，as in NS s. It is not just the have societies forming in a new physical en- 
vironment; it 


i 
so that ongoing electronic communication changes the way individuals 





of etiquette. arise in electronic commu- 


understand their identity, their values. and their plans for their lives. The changes that are 
taking place must be examined and understood, especially the changes affecting fundamen- 
tal social values such as democracy, community. freedom. and peace. 


Of course, speculating about the Internet is now a popular pastime. and it is 





mportant 
to separate the hype from the reality. The reality is generally much more complex and 
much more subtle. We will not engage in speculation and prediction about the future. 
Rather. we want to emphasize how much better off the world would be if (instead of watc- 
hing social impacts of computer technology after the fact) computer engineers and scien- 
tists were thinking about the potential effects early in the design process. Of course. this 
can only happen if computer scientists and engineers are encouraged to see the social-ethical 


issues as a component of their professional responsibility. 
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New Words and Phrases 

















twist vt. 48; 拧 使 苦恼 provision n. 规定 ; 条 款 ; 准备 ; 供应 品 
n. 扭曲; fr; 扭伤 vt. 供给 …… 食 物 及 必需 品 
articulate wt。 清 晰 地 发 ( 音 ); 明确 有 力 地 ”patent n， 专 利 
表达 ; 用 关节 连接 ;使 相互 连贯 YL， 获得 …… 专 利 
documentation n. 文件 编制 文件 的 提供 ; catastrophic adj. 灾难 的 ; BMW. ARES 
文档 编制 局 的 





harness ul. 利用; 控制 
controversy n. 争论 ; 争议; 辩论 





facilitate ut. 促进 ; 使 容易 inherently 

toll booths 收费 亭 infrast ren， 基 础 设施 

philosophical adj. 哲学 上 的 omgoihg Adj. 不 间断 的 ， 进 行 的 ;前进 的 
stake n, 桩 ， 赌 注 ， 奖 人 金 n. 前 进 ; 行为 ,举止 





vt. 资助 支持， 把 …… 押 下 打 赂 sR n. 推断 ; 沉思 


legislation n. sriks 法律 的 制定 ; 法 ] 
Exercises JU yA% 


X 
I. Answer hoe questions. E 
I 
de 


MES. ethical issues that arise from computer technology? 
2. Why are computers and privacy issues ethical issues? 


3. What risks might computer technology bring about? 
]. Translate the following sentences into Chinese. 


l. The effects of a new technology on society can draw attention to an old issue and 
can change our understanding of that issue. The issues listed in this section— privacy. 
property rights. risk and reliability. and global communication—were of concern. even 


problematic, before computers were an important technology. 














2. Those who are aware of privacy issues. for example. are more likely to take those 











issues into account when they design database management systems; those who are aware 
of risk and reliability issues are more likely to articulate these issues to clients and attend to 


them in design and documentation. 














3. Although we cannot expect computer professionals to be experts on the philosophi- 
cal and political analysis of privacy. it seems clear that the more they know. the better the 


computer technology they produce is likely to be. 

















Automatic Generation of SM Synthetic r Target Applications to Effi- 
ciently Design Multicor: rs 


Prohibitive iie time with pr NA models and unavailability of propri- 
etary target lots make mion AE design very tedious. The framework pro- 
posed in this paper is the first attempt to automatically generate synthetic benchmark prox- 
ies for real world multithreaded applications. The framework includes metrics that charac- 
terize the behavior of the workloads in the shared caches. coherence logic. out-of-order 
cores, interconnection network and DRAM. The framework is evaluated by generating 
proxies for the workloads in the multithreaded PARSEC benchmark suite and validating 
their fidelity by comparing the microarchitecture dependent and independent metrics to that 
of the original workloads. The average error in IPC is 4. 87 percent and maximum error is 
10. 8 percent for Raytrace in comparison to the original workloads. The average error in 
the power-per-cycle metric is 2. 73 percent with a maximum of 5. 5 percent when compared 
to original workloads. The representativeness of the proxies to that of the original work- 
loads in terms of their sensitivity to design changes is evaluated by finding the correlation 
coefficient between the trends followed by the synthetic and the original for design changes 
in IPC, which is 0.92. A speedup of four to six orders of magnitude is achieved by using 


the synthetic proxies over the original workloads. 
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Coordinating Garbage Collection for Arrays of Solid-state Drives 


Although solid-state drives (SSDs) offer significant performance improvements over 
hard disk drives (HDDs) for a number of workloads, they can exhibit substantial variance 
in request latency and throughput as a result of garbage collection (GC). When GC con- 
flicts with an I/O stream. the stream can make no forward progress until the GC cycle 
completes. GC cycles are scheduled by logic internal to the SSD based on several factors 
such as the pattern, frequency. and volume of write requests. When SSDs are used in a 
RAID with currently available technology. the lack of coordination of the SSD-local GC cy- 
cles amplifies this performance variance. We propose a global garbage collection (GGC) 
mechanism to improve response times and reduce performance 
Ds. We include a high-level design of SSD-aware RAID 


devices and algorithms to coordinate the Gi 











c ier eXelop reactive and proactive GC 


Lo n block erase counts for vari- 


T 


coordination algorithms and evaluate their ow Pg 






ous workloads. Our simulations show aee nation by a reactive scheme improves 
average response time and reduces perfo; VK a S PRÉ for a wide variety of enterprise 
workloads. For bursty, write-domin: di ae response time was improved by 69 
percent and performance i s reduced by 71 nt. We show that a proactive 
GC coordination algorithm see r improve the I as times by up to 9 percent 
and the performance vanabilit$ by up to 15 percen also observe that it could increase 
the lifetimes of SS ine workloads, € Financial) by reducing the number of 
block erase oe up to 79 percent o a reactive algorithm for write-dominant 
[i 


enterprise wi 


Optimal Channel Estimation in Beamformed Systems for Common-randomness-based Se- 
cret Key Establishment 


Establishing secret keys from the commonly-observed randomness of reciprocal wire- 
less propagation channels has recently received considerable attention. We consider such 
key establishment between two multiantenna nodes that use beamforming for communica- 
tion. showing that the upper bound on the number of key bits that can be generated from 
the channel observations can be maximized by properly probing the channel. Specifically. 
we demonstrate that the eigenvectors of the channel spatial covariance matrix should be 
used as beamformer weights during channel estimation and we optimize the energy alloca- 
ted to channel estimation for each beamformer weight under a total energy constraint. Fi- 
nally. by assuming that the channel covariance is separable. we illustrate implementation 
of the technique for practical beamforming systems and more advanced signal models that 


incorporate antenna mutual coupling. 





unit 8 


CRYPTOGRAPHY 


Text: Public-key Infrast res 


1. Personal Security Environments / SS 


Importance 小 S XM 
Re or decry| Xmen using a public-key system, 





Bob must keep. thi secret because everybody who knows 





: — NAA 5 
the key can NO SSages in Bob's nami “décrypt secret documents that were sent to 
Bob. Therefor 


keys are securely stored. Since the private keys should not leave the PSE. it also does the 


ob needs a personal sedurity environment (PSE) in which his private 


signing or decrypting. 
Frequently. the PSE also generates the private keys. If the private keys are generated 


elsewhere, then at least the generating institution knows Bob’s secret keys. which maycor- 





rupt the security of the system. On the other hand, secure key generation may require re- 
sources not present in the PSE. For example. for RSA"! keys random primes of a fixed bit 
length are required. In particular. the key generating environment must generate large. 
cryptographically secure, random numbers. If the random number generator of the PSE is 
weak, then the public-key system is insecure. It may therefore make sense to have the 


RSA keys generated by a trusted institution. 
Implementation 


The more sensitive the documents that are signed or encrypted. the more secure the 
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PSE must be. A simple PSE is a file in Bob's home directory that can be accessed only after 
entering a secret password. 

This password may. for example. be used to decrypt the information. The security of 
a software PSE relies on the security of the underlying operating system. One may argue 
that operating systems must be very secure anyway and that they are therefore able to pro- 
tect the PSE. Operating systems. for example. prevent unauthorized users from becoming 
administrators. On the other hand. it is well known that with sufficient effort the security 
of most operating systems can be successfully attacked. Therefore. a software PSE is not 
adequate for applications that require high security. 


It is more secure to put the PSE on a smart card. Bob can cayry his smart card in his 





wallet. If the card is in the smart card reader, it only permits,v ited access. Manipu- 
lating its hardware or software is very difficult XM ssful attacks have been re- 
ported). 

Unfortunately. computations on smart car «^ slow. Therefore. it is im- 
possible to decrypt large documents on a En + so public keys encrypt session keys 
which, in turn, are used to encrypt the SC The encrypted session key is appended 
to the encrypted document. NA € decrypts the session key. The decryption 
of the document is then E > or mh X 


Representation Prohle! 


qe 


Even if Aloe Ws mart card for NT is still a severe security problem. If 
Alice wants gn fedocument, she sta rogram on her PC, which sends the docu- 
ment or its hash, Value to the smart card. where it is signed. With some effort. the attack- 
er, Oscar, can manipulate the signing program on Alice’s PC such that it sends a document 
to the smart card that is different from the one that Alice intended to sign. Because the 
smart card has no display. Alice is unable to detect this fraud. It is therefore possible that 
Alice could sign documents that she never wanted to sign. This problem is called the repre- 
sentation problem for signatures. The more important documents are for which digital sig- 
natures are accepted, the more dramatic the representation problem becomes. The problem 
is solved if Alice sees what she signs. For this purpose, Alice’s PSE needs a display. One 
possibility is to use a cellular phone as a PSE. But its display is very small. Hence, the 
documents that can be signed securely on it are rather short. It depends on the solution of 
the representation problem whether digital signatures can be used to replace handwritten 


signatures, 











2. Certification Authorities 


If Alice uses a public-key system. it is not sufficient for her to keep her own private 
keys secret. If she uses the public key of Bob. she must be sure that it is really Bob's key. 
If the attacker, Oscar. is able to substitute his own public key for Bob's public key. then 
Oscar can decrypt secret messages to Bob and he can sign documents in Bob's name. 

One solution of this problem is to establish trusted authorities. Each user is associated 
with such a certification authority CA). The user trusts his CA, With its signature, the 


eir CA, We now ex- 


CA certifies the correctness and validity of the public keys of itspusers. The users know 
their CA's public key. Therefore. they can verify the signat A» 


plain in more detail what a CA does. 


Registration Qs 


If Bob becomes a new user of the PRESA em, then he is registered by his CA. 
Re Sonal data. The CA verifies Bob's informa- 


im person and present some identification. The 


He tells the CA his name and other rele 






tion. Bob can. for example. go to 


CA issues a user name for Boh t 


ifferent from soil ene of all other users in the 
system. Bob ill use this nai Abr example. if iol iments. If Bob wants to keep 
n 


his name secret, then 220 use a pseudon + only the CA knows Bob's real 
name. 又 x& V 
~、 IA 

Key Geni ! YS. 


Bob's public and private keys are generated either in his PSE or by his CA. It is rec- 
ommended that Bob not know his private keys. because then he cannot inform others about 
those keys. The private keys are stored in Bob's PSE. The public keys are stored in a di- 
rectory of the CA. Clearly. the keys must be protected while they are communicated be- 
tween Bob and his CA. 

For each purpose (for example. signing. encryption, and identification), a separate 
key pair is required. Otherwise. the system may become insecure. This is illustrated in the 


next example. 
Certification 


The CA generates a certificate. which establishes a verifiable connection between Bob 
and his public keys. This certificate is a string. which is signed by the CA and contains at 


least the following information. 
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. The user name or the pseudonym of Bob. 

. Bob's public keys. 

. The names of the algorithms in which the public keys are used. 
. The serial number of the certificate. 

. The beginning and end of the validity of the certificate. 

. The name of the CA. 


. Restrictions that apply to the use of the certificate. 


1 
2 
3 
4 
5 
6 
7 


The certificate is stored. together with the user name. in a directory. Only the CA is 


allowed to write in this directory. but all users of the CA can read the information in the 


directory. 


New Words and Phrases gS 
cryptography n， 密 码 学 Su. 正确 ,正确 性 
decrypt vt. 解码 Xm cation n， 识 别 试验 ， 结 构 鉴定 
corrupt adj. 错误 百出 的 "Ki eudonym n. 假名， 化 名 

vt. & vi. 损坏 certification n. EB]. "Eze. WEP 


prime adj. 素数 的 E, S directory 


dot 
append vt， 附 加 ， 添 加 e :指导 的 ， 咨 询 的 ， 管 理 的 


fraud n. KYE, ie a 
Bg 
Notes N 


1. RSA: RSA 公 钥 加 密 算法 是 1977 年 由 罗 纳 德 。 李维斯 特 (Ron Rives), piii + gë 
莫 尔 (Adi Shamir) 和 伦 纳 德 。 阿 德 最 (Leonard Adleman) 一 起 提出 的 。 当 时 他 们 三 人 都 在 
麻 省 理工 学 院 工 作 。RSA 就 是 他 们 三 人 姓氏 开头 字母 拼 在 一 起 组 成 的 。RSA 是 目前 最 有 
影响 力 的 公 钥 加 密 算法 ， 它 能 够 抵抗 目前 为 止 已 知 的 绝 大 多 数 密码 攻击 ， 已 被 ISO 推荐 为 
公 钥 数据 加 密 标 准 。 只 有 短 的 RSA 钥匙 才 可 能 被 强力 方式 解 破 。 到 2008 年 为 止 ， 世 界 上 
还 没有 任何 可 靠 的 攻击 RSA 算法 的 方式 。 只 要 其 钥匙 的 长 度 足够 长 ， 用 RSA 加 密 的 信息 
实际 上 是 不 能 被 解 破 的 。 但 在 分 布 式 计算 和 量子 计算 机 理论 日 趋 成 熟 的 今天 ，RSA 加 密 
安全 性 受到 了 挑战 。RSA 算法 基于 一 个 十 分 简单 的 数论 事实 : 将 两 个 大 素数 相 乘 十 分 容 
易 ， 但 想 要 对 其 乘积 进行 因 式 分 解 却 极其 困难 ， 因 此 可 以 将 乘积 公开 作为 加 密 密 钥 。 









































Exercises 


I. Please translate the following words and phrases into Chinese. 


private key 

personal security environment 
key generation 

public-key system 

smart card reader 

session key 


encrypted session 





hash value 


SOP O° gum Qv Sy dee pu. op fp 


certification authority 


I. Fill in the blanks with the missi 









certificates authen exchanged 





verified 


insecure dissi 
distribution placed 
[m "mn 


validity 





encryption 
























keys in asymmetric 


need not be kept secret, key in 


those systems is simpler than in symmetric schemes. Private keys. however. must be kept 
. Also. public must be protected from falsification and abuse. There- 

fore. appropriate public-key infrastructures must be set up. They are responsible for key 
and management, 


2. Depending on their use, keys in systems must be stored even after they 


expire. Public 


keys must be 


keys must be stored as long as signatures generated with those 


. The CA stores 


for public signature keys. 


3. Authentication keys. private signature keys. and public 


put in archives, They must be stored only as long as they are used for 


keys need not be 


+ genera- 


ting signatures. or encrypting documents. 
4. The CA maintains a of all certificates together with the name of the own- 
er of each certificate. 


5. All keys in a public-key system have a certain period of . Before a key ex- 








pires, it must be by a new, valid key. This new key is between the CA 
and the users in such a way that it does not become even if the old, invalid key 


becomes known. 
I. Translate the following paragraphs into Chinese. 


1. In public-key infrastructures it is frequently useful to be able to reconstruct private 
keys. For example. if a user has lost his smart card that contains his private decryption 
key, then he cannot decrypt any encrypted file on his computer anymore. So those encrypt- 
ed files are then inaccessible for the user unless it is possible to reconstruct the decryption 


key. However, for security reasons it may be important that the key cannot be reconstruc- 





ted by a single person. That person could abuse the knowl. he private key. It is 





























2. The soni s —: based on the intractability of certain 
bi 


computation Now s. The security o A and Rabin schemes is based on the hard- 
ness of integer orization. The security of the ElGamal protocols and of DSA is based on 


the intractability of computing discrete logarithms in finite prime fields. However. none of 





those computational problems is provably intractable. Algorithmic progress has almost al- 
ways been faster than predicted and it is known that quantum computers will make integer 


factorization and discrete logarithm computation in the relevant groups easy. 
































UNIT 8 CRYPTOGRAPHY 








Reading: Passwords 


Access to UNIX or Windows NT systems is typically controlled by password systems. 
Each user picks his individual and secret password w. The computer stores the image f 
(w) of the password vw under a one-way function f. If the user wants access to his comput- 
er, he enters his name and password w. The computer determines f(x) and compares this 
value with the stored value. If they are identical, then access is granted. Otherwise, the 
user is rejected. 

Passwords are also used to control access to World Wi ages or to files that 
contain private encryption or signature keys. K 

The password file does not need to be kept sec: nce it contains only the images 
[f Cw). of the passwords w and f is a one-way fugctián. Nevertheless. password identifica- 


tion systems are not very secure. - 






A user must memorize his or her pass’ . Therefore. many users choose the first 


name of their spouse or children as thi Ssword, An attacker can mount a dictionary at- 





tack. For all words w in a dictio he computes UE. om the result with the 
entries in the password file. finds an entry of rd file. he has determined the 
corresponding passwor is. therefore. reco S. use symbols such as $ or # in 
the passwords. Tle onary attacks angi Nie, but it is also harder to memorize 
the passwordgXIt isJałso possible to stor assword on a smart card. Instead of typing 
in his mie. e prover inserts his smart card into the smart card reader. The verifier 
reads the password from the smart card. There is no need for the user to memorize or even 
know the password. On the contrary, if the user does not know his password he cannot 
give it away. 

An attacker can also tap the connection between the prover and the verifier and can 
learn the password. This is particularly successful if there is a great distance between the 
prover and the verifier; for example, if a password system is used to protect World Wide 
Web access. Note that the use of smart cards does not prevent this attack. 

Finally, the attacker can also replace an entry f(w) in the password file with the im- 
age f(v) of his own password v. Then. using the password v. he can get access. There- 


fore. the password file must be write protected. 
One-Time Passwords 


Using passwords is dangerous because an attacker can learn the passwords by tapping 
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the connection between the prover and the verifier. With one-time passwords. this attack 
does not work. One-time passwords are used for one identification. For the next identifica- 
tion, a new one-time password is used. 

A simple way of implementing one-time passwords is the following. The verifier has a 
listf Cw), fw) . +t, f Go) of images of passwords w, **, w,. The prover knows 
this list of passwords and uses its elements for the identifications, Since the prover must 
store all passwords in advance. an attacker could learn some or all of them. 

It is also possible that the prover and the verifier share a secret function f of an initial 
string w. Then the one-time passwords are W;— f' (w), i :::: 0. The prover can put the 


current password W, and the one-way function f on a smart card. He does not need a large 


password file. 
Challenge-Response Identification SS 
SM that an attacker can 


hi 
learn passwords long before the actual PES. This is even true for one-time p 


Password identification system protocols 





word systems, Challenge response identifica systems do not have this problem, 
Alice wants to identify herself a challenge response system. Bob asks a ques- 


tion, the challenge. Alice computes response using ecret key and sends it to Bob, 


he same secret me rresponding public key. 
Symmetric Syste x V 


We descxi ple challenge resp ilentification system which uses a symmetric 
cryptosystem. SS 


are the same. Alice and Bob share a secret key &. Alice wants to identify herself to Bob. 





Bob verifies the response us 





ume that the encryption key and the corresponding decryption key 





Bob sends a random number r to Alice. Alice encrypts this random number by computingc 
=E,(r) and sends the ciphertext c to Bob. Bob decrypts the ciphertext; that is, he com- 
putes 7/— D, (c) and compares the result with his chosen random number r. If r=r , then 
he accepts the proof of identity; otherwise he rejects it. 

This protocol proves that Alice knows the secret key at the time of the identification. 
It is not possible for Bob or an attacker to compute or obtain the correct response in ad- 
vance. But since the verifier, Bob, also knows Alice's secret key, this key cannot be used 


for identification with another verifier since Bob can then pretend that he is Alice. 
Public-key Systems 


Challenge response systems can also be based on public-key signature schemes. If Al- 


ice wants to identify herself. she obtains a random number from Bob and signs this random 
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number with her private key. Bob verifies the signature, thereby verifying the identity of 
Alice. 
In this protocol, Bob cannot pretend that he is Alice. He only knows Alice’s public 





key. But it is necessary that Bob obtains the authentic public key of Alice. If the attacker, 
Oscar, can replace Alice’s public key with his own, then he can convince Bob that he is Al- 


ice. 


New Words and Phrases 


encryption n. JJ; 加 密 术 symmetric adj. 对称 的 ; 匀称 
dictionary attack 字典 攻击 cryptosyst 统 
verifier n RA: 核实 者 ; 校对 机 ciphertext < 
protocol n， 协 议 ; 草案 


Exercises N 
I. Answer the following que: em 


. What does M^ mean? ye 
2. What's the ur of implementin eMme lm 
3. What's ugue ntage of mà or system protocols? 
I. Trail te following "-— Chinese. 


1. Access to UNIX or Windows NT systems is typically controlled by password sys- 
tems. Each user picks his individual and secret password w. The computer stores the im- 
age f(w) of the password w under a one-way function f. If the user wants access to his 
computer, he enters his name and password w. The computer determines f(z) and com- 
pares this value with the stored value. If they are identical. then access is granted. Other- 


wise. the user is rejected. 




















2. An attacker can also tap the connection between the prover and the verifier and can 








A | 


learn the password. This is particularly successful if there is a great distance between the 





prover and the verifier; for example. if a password system is used to protect World Wide 


Web access. Note that the use of smart cards does not prevent this attack. 




















3. Password identification system protocols have the disadvantage that an attacker can 
learn passwords long before the actual identification. This is € ue for one-time pass- 


word systems. Challenge response identification systems t e this problem. 


Sr 
SF 


ae 
A erat "d 


On-line NS à Y 




















and the Hash-CBC Constructions 


We initiate a study of on-line ciphers. These are ciphers that can take input plaintexts 
of large and varying lengths and will output the i^ block of the ciphertext after having pro- 
cessed only the first i blocks of the plaintext. Such ciphers permit length-preserving en- 
cryption of a data stream with only a single pass through the data. We provide security def- 
initions for this primitive and study its basic properties. We then provide attacks on some 
possible candidates, including CBC with fixed IV. We then provide two constructions. 
HCBC1 and HCBC2. based on a given block cipher E and a family of computationally AXU 
functions. HCBC1 is proven secure against chosen-plaintext attacks assuming that E is a 
PRP secure against chosen-plaintext attacks. while HCBC2 is proven secure against cho- 


sen-ciphertext attacks assuming that E is a PRP secure against chosen-ciphertext attacks. 
Security Analysis of Randomize-hash-then-sign Digital Signatures 


At CRYPTO 2006. Halevi and Krawezyk proposed two randomized hash function 











modes and analyzed the security of digital signature algorithms based on these construc- 
tions. They showed that the security of signature schemes based on the two randomized 
hash function modes relies on properties similar to the second preimage resistance rather 
than on the collision resistance property of the hash functions. One of the randomized hash 
function modes was named the RMX hash function mode and was recommended for practi- 
cal purposes. The National Institute of Standards and Technology (NIST), U.S. A. 
standardized a variant of the RMX hash function mode and published this standard in the 
Special Publication (SP) 800-106. 

Tn this article, we first discuss a generic online birthday existential forgery attack of 
Dang and Perlner on the RMX-hash-then-sign schemes. We dos hat a variant of this at- 
tack can be applied to forge the other randomize-hash~ ik emes, We point out 
practical limitations of the generic forgery attack on the R sh-then-sign schemes. We 
then show that these limitations can be overcome WE. hash-then-sign schemes if it 
is easy to find fixed points for the underlying c essión functions. such as for the Da- 
vies-Meyer construction used in the Me tions such as MD5 designed by Rivest 
and the SHA family of hash functions 
USA and published by NIST in the Fe 


show an online birthday forg 
~ 


pe the National Security Agency (NSA), 







leon Processing Standards (FIPS). We 


er 








on this class ipse by using a variant of 
Dean's method of finding fi int expandable mesgages or hash functions based on the 
his forgery atta: ME 

X standardized in SP 800-106. We discuss some impor- 


based on the Me ol 
tant wre.’ War attacks and snl applicability on signature schemes based 


on hash functi 


Davies-Meyer construct so applicable to signature schemes 


with “built-in” randomization. Finally. we compare our attacks on ran- 
domize-hash-then-sign schemes with the generic forgery attacks on the standard hash-based 


message authentication code (HMAC). 
Security Models and Proof Strategies for Plaintext-aware Encryption 


Plaintext-aware encryption is a simple concept: a public-key encryption scheme is 
plaintext aware if no polynomial-time algorithm can create a ciphertext without "knowing" 
the underlying message. However. the formal definitions of plaintext awareness are com- 
plex. This paper analyses these formal security definitions and presents the only known vi- 
able strategy for proving a scheme is PA2 plaintext aware. At the heart of this strategy is a 
new notion called PA1 + plaintext awareness. This security notion conceptually sits be- 
tween PAI and PA2 plaintext awareness (although it is formally distinct from either of 
these notions). We show exactly how this new security notion relates to the existing no- 


tions and how it can be used to prove PA2 plaintext awareness. 
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Text: Document Delineation and XV equence Decoding 


Obtaining the Character Sequence in a M 


Digital documents that are the in 
or on a web server. The first step I 
ar sequence of characters. rg f plain Englis 
ial. But often things ge ay wy ore complex. Ty nce of characters may be encoded 
by one of various singjle-byte or -——2 s such as Unicode UTF-8'!! , or 
ador-specific en e need to determine the correct encoding. 


various Pc e 
This can be re ed as a machine learning classification problem. but is often handled by 


heuristic methods. user selection. or using provided document metadata. Once the enco- 







an ks process are typically bytes in a file 


essing is to coi this byte sequence into a line- 







ASCII encoding, this is triv- 





ding is determined, we decode the byte sequence to a character sequence. We might save 
the choice of encoding because it gives some evidence about what language the document is 
written in, 

The characters may have to be decoded out of some binary representation like Mi- 
crosoft Word DOC files and/or a compressed format such as zip files. Again, we must de- 
termine the document format. and then an appropriate decoder has to be used. Even for 
plain text documents. additional decoding may need to be done. In XML documents. 
character entities, such as &-amp; . need to be decoded to give the correct character. name- 
ly. & for &amp;. Finally, the textual part of the document may need to be extracted out 
of other material that will not be processed. This might be the desired handling for XML 
files. if the markup is going to be ignored; we would almost certainly want to do this with 


postscript or PDF files. We do not deal further with these issues in this book. and assume 
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Sy 
henceforth that our documents are a list of characters. Commercial products usually need 
to support a broad range of document types and encodings. because users want things to 
just work with their data as is. Often, they just think of documents as text inside applica- 
tions and are not even aware of how it is encoded on disk. This problem is usually solved 
by licensing a software library that handles decoding document formats and character en- 
codings. 

The idea that text is a linear sequence of characters is also called into question by some 
writing systems, such as Arabic. where text takes on some two-dimensional and mixed-or- 
der characteristics, as shown in Figure 9. 1 and Figure 9. 2. But. despite some complicated 
writing system conventions, there is an underlying sequence of sounds being represented 
and hence an essentially linear structure remains. This is $ f esented in the digital 


representation of Arabic, as shown in Figure 9. 1. 


(a e ^" 
了 
[kitabu SS x ok 
Figure 9.1 An example of a eei dard Arabic word, The writing is from right to left and 
letters undergo complex mutations as eat ‘The repi of short vowels (here, /i/ and / 
u/) and the final /n/ (nunation -deiris from strict lineari i represented as diacritics above and 
below letters, Nevertheless, presented text is stil ri linear ordering of characters representing 





sounds, Full vocaliza y as ere, normally apj the Koran and children's books, Day-to-day 
text is unvocal NS vowels are not repi t the letter for à would still appear) or partially vo- 
calized, with els inserted in places where {he writer perceives ambiguities, These choices add fur- 


ther complexities to indexing. 


pid dli AGS! ca Lle 132 ae 1962 Ai à oil jal cual 


> —— — START 


Figure 9.2 The conceptual linear order of characters is not necessarily the order that you see 
on the page. In languages that are written right to left, such as Hebrew and Arabic, it is quite com- 
mon to also have left-to-right text interspersed, such as numbers and dollar amounts, With modern 
Unicode representation concepts, the order of characters in files matches the conceptual order, and 
the reversal of displayed characters is handled by the rendering system, but this may not be true for 
documents in older encodings. 


“Algeria achieved its independence in 1962 after 132 years of French occupation. ” 
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Choosing a Document Unit 


The next phase is to determine what the document unit for indexing is. Thus unit far. 


we have assumed that documents are fixed units for the purposes of indexing. For exam- 





ple. we take each file in a folder as a document. But there are many cases in which you 
might want to do something different. A traditional UNIX (mbox-format) email file stores 
a sequence of email messages (an email folder) in one file. but you might wish to regard 
each email message as a separate document. Many email messages now contain attached 
documents, and you might then want to regard the email message and each contained at- 
tachment as separate documents. If an email message has an attached zip file, you might 
want to decode the zip file and regard each file it contains as e document. Going in 
tex2htmD take things that 
you might regard as a single document (e.g. , a Ponga ile or a LATEX document? 
and split them into separate HTML pages for e; lidWor subsection. stored as separate 
files. In these cases. you might want to WS 


More generally. for very long docu t 







the opposite direction. various pieces of web software (sui 







tiple files into a single document. 

e issue of indexing granularity arises. For 
a collection of books, it would usua ad idea to index an entire book as a document. 
A search for Chinese toys might up a book that viis China in the first chapter 
and toys in the last chapter this does not NE al 


may well wish to index %ch chapter or parag 


nt to the query. Instead, we 
a mini-document. Matches are then 
more likely to be ri ant and because theydoc ts are smaller it will be much easier for 
the user to PN NG: vant passages in éhedGoument. But why stop there? We could treat 
individual PA Ne as mini-documents. It becomes clear that there is a precision/recall 
tradeoff here. If the units get too small, we are likely to miss important passages because 
terms were distributed over several mini-documents. whereas if units are too large we tend 
to get spurious matches and the relevant information is hard for the user to find. The prob- 
lems with large document units can be alleviated by use of explicit or implicit proximity 
search and the tradeoffs in resulting system performance that we are hinting at are dis- 
cussed in later parts. The issue of index granularity, and in particular a need to simultane- 
ously index documents at multiple levels of granularity, appears prominently in XML re- 
trieval. An information retrieval (IR) system should be designed to offer choices of granu- 
larity. For this choice to be made well, the person who is deploying the system must have 
a good understanding of the document collection. the users. and their likely information 
needs and usage patterns. For now. we assume that a suitable size document unit has been 


chosen. together with an appropriate way of dividing or aggregating files. if needed. 
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New Words and Phrases 





retrieval n. 检索 perceive vt. & vi. WWA, HR., Mfg 
delineation n. 描述 ， 描 写 ambiguity n. 含糊 ， 歧 义 ， 模 棱 两 可 
byten. FW Hebrew n. AIHA. MKA. WARE 
trivial adj. BUNNY. JOOMANY intersperse vt. 散布 ， 散 置 
heuristic adj. JARS. RAW Unicode n， 统 一 码 ( 采 用 双 字 节 对 字符 进行 
metadata n， 元 数据 编码 ) 
entity n， 实 体 ， 本 质 folder n. 文件 夹 
license vt， 同 意 ， 发 许可 证 subsection n. rds 

Np. PIRE 


n. 

vocalize vt. & vi， 成 为 有 声 ， 以 声音 表示 出 us 7 i, 
mutation 2， 变 化 ， 转 变 spia 清楚 的 
vowel n， 元 音 , 母音 无疑 的 ,绝对 的 
nunation n. 词尾 加 “n”( 如 阿拉 伯 语 pos ity E, Wir; 接近 度 

名 词 的 词尾 变化 ) Eat adv, 显著 地 ， 重 要 地 
linearity n， 线 性 aggregate ot. & vi， 总 计 达 ;使 聚集 
diacritic adj， 可 区 别 的 ， E X 
Notes E id 


1l. UTR ok Transfori Format， 是 一 种 针对 Unicode 的 可 变 长 度 
字符 编码 ， SS 国 码 。 由 Ken Thompson F 1992 年 创建 。 现 在 已 经 标准 化 为 RFC 
3629。UTF-8 用 1 到 4 个 字 节 编码 Unicode 字符 。 用 在 网 页 上 可 以 同一 页 面 显 示 中 文 简 
体 、 繁 体 及 其 他 语言 (如 日 文 、 韩 文 ) 。 
2. XML; 可 扩展 标记 语言 ， 标 准 通 用 标记 语言 的 子 集 ， 一 种 用 于 标记 电子 文件 使 其 
具有 结构 性 的 标记 语言 。 它 可 以 用 来 标记 数据 、 定 义 数据 类 型 ， 是 一 种 允许 用 户 对 自己 的 
标记 语言 进行 定义 的 源 语言 。 它 非常 适合 万 维 网 传输 ， 提 供 统一 的 方法 来 描述 和 交换 独立 
于 应 用 程序 或 供应 商 的 结构 化 数据 。 






































Exercises 


I. Please translate the following words and phrases into Chinese. 


l. information retrieval 


2. character sequence 








. encoding scheme 


3 

4. vendor-specific standard 
5. heuristic methods 
6. binary representation 
7. HTML 

8. aggregating file 

9. linear sequence 

1 


0. document format 


I. Fill in the blanks with the missing word(s) from the table below. 
































punctuation sequence correspond) specified 
strategies collections additi ly hashing 
inefficient query er] contain 
inverted documents : - hrases material 
semantically throwing unstructured tokens 





ly documents) of an 


l. Information retrieval (IR) Ss finding ( 
nature Cusually text) that "pi information nei 
stored on computers). 以 

rieval can also omo 
Ae 






within large Cusually 







2. Informati nds of data and information problems 


一 M « » 
beyond that n the core definit ve. The term “unstructured data" refers to 


data that does ave clears overt, easy-for-a-computer structure. 


3. Given a character and a defined document unit. tokenization is the task of 
chopping it up into pieces. called tokens. perhaps at the same time away certain 
characters. such as p 

4. Having broken up our documents (and also our query) into + the easy 
case is if tokens in the just match tokens in the token list of the document. How- 
ever. there are many cases when two character sequences are not quite the same but you 
would like a match to occur. For instance. if you search for USA. you might hope to also 
match containing U. S. A. 

5. For grammatical reasons. documents are going to use different forms of a word, 
such as organize, organizes. and organizing. . there are families of derivationally 
related words with similar meanings, such as democracy, democratic, and democratization. 
In many situations, it seems as if it would be useful for a search for one of these words to 


return documents that another word in the set. 
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6. The of biword indexes and positional indexes can be fruitfully combined. 
If users commonly query on particular + such as Michael Jackson. it is quite 
to keep positional postings lists. A combination strategy uses a phrase 


index. or just a biword index. for certain queries and uses a positional index for other 
phrase queries. 

7. Given an index and a query. our first task is to determine whether each 
query term exists in the vocabulary and. if so. identify the pointer to the post- 
ings. This vocabulary lookup operation uses a classical data structure called the dictionary 


and has two broad classes of solutions: and search trees. 


I. Translate the following sentences into Chinese. 


1. Our final technique for tolerant retrieval has to d: phonetic correction; mis- 
spellings that arise because the user types a query t unds like the target term, Such 
algorithms are especially applicable to searches es of people. The main idea here 


is to generate, for each term, a “phonetic TN at similar-sounding terms hash to the 
a} 


2. One NSS compression is im! ely clear. We need less disk space. As we 
will see, comppesSion ratios of 1 : 4 are easy to achieve, potentially cutting the cost of sto- 


ring the index by 75%. 


same value. 



































3. There are two more subtle benefits of compression. The first is increased use of 
caching. Search systems use some parts of the dictionary and the index much more than 
others. The second more subtle advantage of compression is faster transfer of data from 
disk to memory. Efficient decompression algorithms run so fast on modern hardware that 
the total time of transferring a compressed chunk of data from disk and then decompressing 


it is usually less than transferring the same chunk of data in uncompressed form. 
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Reading: Search Structures for Dictionaries 


Given an inverted index and a query. our first task is to determine whether each query 
term exists in the vocabulary and, if so, identify the pointer to the corresponding postings. 
This vocabulary lookup operation uses a classical data struc; ed the dictionary and 
has two broad classes of solutions: hashing and search AS he literature of data struc- 
tures, the entries in the vocabulary (in our case, ter re\often referred to as keys. The 
choice of solution (hashing or search trees) is ey a number of questions, 

(D How many keys are we likely to hay the number likely to remain static, or 
change a lot? And in the case of changes,a MA to only have new keys inserted. or 


to also have some keys in the dictiona eleted? (3) What are the relative frequencies 





with which various keys will be dccéssed? XS 

Hashing has been use f ictionary lookup i Aa arch engines. Each vocabulary 
term (key) is hashed id orti over a large RA space that hash collisions are un- 
likely; collisions ew ed by auxiliary stru s that can demand care to maintain. At 
query time, ashJeach query term sep and, following a pointer to the correspond- 
ing postings N g into account any logic for resolving hash collisions. There is no easy 
way to find minor variants of a query term (such as the accented and unaccented versions of 
a word like resume). because these could be hashed to very different integers. In particu- 
lar. we cannot seek (for instance) all terms beginning with the prefix automat. Finally. in 
a setting (such as the Web). where the size of the vocabulary keeps growing. a hash func- 
tion designed for current needs may not suffice in a few years’ time. 

Search trees overcome many of these issues-for instance. they permit us to enumerate 
all vocabulary terms beginning with automat. The best-known search tree is the binary 
tree, in which each internal node has two children. The search for a term begins at the 
root of the tree. Each internal node (including the root) represents a binary test, based on 
whose outcome the search proceeds to one of the two subtrees below that node. Figure 9. 3 
gives an example of a binary search tree used for a dictionary. Efficient search (with a 
number of comparisons that is O(logM)) hinges on the tree being balanced; the numbers of 


terms under the two subtrees of any node are either equal or differ by 1. The principal is- 
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sue here is that of rebalancing; as terms are inserted into or deleted from the binary search 
tree, it needs to be rebalanced so that the balance property is maintained. 

To mitigate rebalancing. one approach is to allow the number of subtrees under an in- 
ternal node to vary in a fixed interval. A search tree commonly used for a dictionary is the 
B-tree—a search tree in which every internal node has a number of children in the interval 


[a, b]. where a and 









= 


A is example, the bra root partitions vocabulary terms into 


er is between a and E rest. 


b are ap X integers; Wage s. 4 shows an example with a=2 and 0 一 4. 
Each isch C 


queni 


two subtrees, those wh 


Er an internal node again represents a test for a range of character se- 





as in the binary tree example of Figure 9. 3. A B-tree may be viewed as "collap- 
sing" multiple levels of the binary tree into one; this is especially advantageous when some 
of the dictionary is disk resident. in which case this collapsing serves the function of pre- 
fetching imminent binary tests. In such cases, the integers a and b are determined by the 
sizes of disk blocks. 

It should be noted that. unlike hashing. search trees demand that the characters used 
in the document collection have a prescribed ordering; for instance. the 26 letters of the 
English alphabet are always listed in the specific order A through Z. Some Asian languages 
such as Chinese do not always have a unique ordering. although by now all languages (in- 
cluding Chinese and Japanese) have adopted a standard ordering system for their character 
sets. Wildcard queries are used in any of the following situations: (1) the user is uncertain 
of the spelling of a query term (e. g. . Sydney vs. Sidney. which leads to the wildcard que- 


ry Sdney); (2) the user is aware of multiple variants of spelling a term and (consciously) 
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Figure 9. 4 A B-tree. In this example every internal node has between 2 and 4 children, 


seeks documents containing any of the variants (e. g., color colour); (3) the user 
seeks documents containing variants of a term that would x by stemming, but is 


unsure whether the search engine performs stemming (e, icial vs, judiciary, leading 






to the wildcard query judicia); or (4) the user is unc the correct rendition of a for- 
eign word or phrase (e. g. . the query ipe 


A query such as mon is known as a ak card query, because the symbol occurs 


only once, at the end of the search st RN tree on the dictionary is a convenient 
way of handling trailing ean o e walk down the tree following the symbols m. 
o, and n in turn. at which point enumerate the si terms in the dictionary with 
the prefix mon. SNP 


Xa 

Finally. we use | Pon on the s Pi index to retrieve all docu- 
ments containing term in W. But M» Sita queries in which the * symbol 
is not constraih¢d to! be at the end of the Sith string? Before handling this general case. 
we mention NN t generalization of trailing wildcard queries. First. consider leading wild- 
card queries. or queries of the form * mon. Consider a reverse P-tree on the dictionary— 
one in which each root-to-leaf path of the P-tree corresponds to a term in the dictionary 
written backwards: thus. the term lemon would. in the B-tree, be represented by the 
pathroot-n-o-m-e-l. A walk down the reverse P-tree then enumerates all terms R in the vo- 
cabulary with a given prefix. 

In fact, using a regular B-tree together with a reverse B-tree. we can handle an even 
more general case: wildcard queries in which there is a single symbol. such as semon. To 
do this. we use the regular B-tree to enumerate the set W of dictionary terms beginning 


with the prefix se and a non-empty suffix. then the reverse B-tree to enumerate the set R of 





terms ending with the suffix mon. Next. we take the intersection W f R of these two 
sets, to arrive at the set of terms that begin with the prefix se and end with the suffix mon. 
Finally. we use the standard inverted index to retrieve all documents containing any terms 


in this intersection. We can thus handle wildcard queries that contain a single symbol using 











two B-trees, the normal B-tree and a reverse B-tree. 


New Words and Phrases 


inverted adj. 反 向 的 ,倒转 的 automat n. 自动 机 
hashing n， 散 列 法 subtree n， 子 树 ， 树 状 子 目录 
lookup n， 查 找 imminent adj. 即将 发 生 的 ， 人 迫切 的 
collision n. 碰撞 ， 冲 突 prescribe vt. & vi， 指定 ， 规 定 
variant n. GRDE; 变形 rendition n， 演 奏 ; 翻译 ; 给 予 

adj. 不 同 的 ， 相 异 的 trailing adj. €h; 草 延 的 
accented adj. WF ee 腔调 的 ; 带 …… H wildcard n. 

音 的 constrain Qiu m 

enumerate vt, JJA, BEE. "x 


Exercises ww 
I. Answer the ne» que: «e 


1. What does hashing BRE S ec xdi engines? 
2. What's the iain mitigate rebalanci 
3. What is ef arching and XE Us criteria? 


I. Tails e following sentences A Chinese. 


1. Given an inverted index and a query. our first task is to determine whether each 
query term exists in the vocabulary and. if so. identify the pointer to the corresponding 
postings. This vocabulary lookup operation uses a classical data structure called the dic- 


tionary and has two broad classes of solutions; hashing and search trees. 

















2. Hashing has been used for dictionary lookup in some search engines. Each vocabu- 
lary term (key) is hashed into an integer over a large enough space that hash collisions are 


unlikely; collisions are resolved by auxiliary structures that can demand care to maintain. 
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3. It should be noted that. unlike hashing. search trees demand that the characters 
used in the document collection have a prescribed ordering; for instance. the 26 letters of 
the English alphabet are always listed in the specific order A through Z. Some Asian lan- 
guages such as Chinese do not always have a unique ordering. although by now all langua- 


ges Cincluding Chinese and Japanese) have adopted a standard ordering system for their 
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In this p EU a perspective across multiple industry problems, including safe- 


ty and security, medical, Web, social and mobile media, and motivate the need for large- 


character sets. 

















scale analysis and retrieval of multimedia data. We describe a multi-layer architecture that 
incorporates capabilities for audio-visual feature extraction. machine learning and semantic 
modeling and provides a powerful framework for learning and classifying contents of multi- 
media data. We discuss the role semantic ontologies for representing audio-visual concepts 
and relationships. which are essential for training semantic classifiers. We discuss the im- 
portance of using faceted classification schemes in particular for organizing multimedia se- 
mantic concepts in order to achieve effective learning and retrieval. We also show how 
training and scoring of multimedia semantics can be implemented on big data distributed 
computing platforms to address both massive-scale analysis and low-latency processing. We 
describe multiple efforts at IBM on image and video analysis and retrieval. including IBM 
Multimedia Analysis and Retrieval System (IMARS). and show recent results for seman- 


tic-based classification and retrieval. We conclude with future directions for improving a- 
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nalysis of multimedia through interactive and curriculum-based techniques for multimedia 


semantics-based learning and retrieval. 
Improving Search Result Summaries by Using Searcher Behavior Data 


Query-biased search result summaries. or "snippets". help users decide whether a re- 
sult is relevant for their information need. and have become increasingly important for 
helping searchers with difficult or ambiguous search tasks. Previously published snippet 
generation algorithms have been primarily based on selecting document fragments most 
similar to the query. which does not take into account which pasts of the document the 
searchers actually found useful. We present a new approach tp ving result summaries 
by incorporating post-click searcher behavior data. such ds se cursor movements and 
scrolling over the result documents. To achieve this X develop a method for collec- 
ting behavioral data with precise association be Sédrcher intent, document examina- 
tion behavior, and the corresponding docume f ents, In turn, this allows us to incor- 
porate page examination behavior signal& in. novel Behavior-Biased Snippet generation 


system (BeBS). By mining searcher, e imation data. BeBS infers document fragments of 






most interest to users. and combi 


s his evidence wi features to select the 
most promising fragments usion in the < y. Our extensive experiments 


and analysis demonstra our method improves XXhe quality of result summaries com- 
pared to existing s of-the-art methods. Wie that this work opens a new direction 





一 


havior data u 


for MSS. ssult presentation; “We make available the code and the search be- 
s 


this study to encourage further research in this area. 
Modeling and Analysis of Cross-session Search Tasks 


The information needs of search engine users vary in complexity. depending on the 


task they are trying to accomplish. Some simple needs can be satisfied with a single query. 





whereas others require a series of queries issued over a longer period of time. While search 
engines effectively satisfy many simple needs. searchers receive little support when their 
information needs span session boundaries. In this work. we propose methods for modeling 
and analyzing user search behavior that extends over multiple search sessions. We focus on 
two problems: (i) given a user query. identify all of the related queries from previous ses- 
sions that the same user has issued. and (ii) given a multi-query task for a user. predict 
whether the user will return to this task in the future. We model both problems within a 


classification framework that uses features of individual queries and long-term user search 
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behavior at different granularity. Experimental evaluation of the proposed models for both 
tasks indicates that it is possible to effectively model and analyze cross-session search be- 
havior. Our findings have implications for improving search for complex information needs 


and designing search engine features to support cross-session search tasks. 





UNIT 
IMAGE PROCESSING 


Text: Basic Gray-level Ima Chocessing 


In this paper. we study basic gray-leve] d nage-" processing operations, The 
Me : 


dependencies I een neighboring pixels are not 


types of operations studied fall into three 










The first arepoint operations. o ng operations that are applied to indi- 





vidual pixels only. Thus. interac! 


considered, nor are operatio teonsider multiple imultaneously to determine an 





output. Since spatial in. a » such as a pixel" n and the values of its neighbors, 






ointJoperations are defin functions of pixel intensity only. The 


are not considered 
c 5 
ding, analyzing, 


basic tool for NS 
histogram. sh ill be introduced below! 


The second class includesarithmetic operations between images of the same spatial di- 


ning image point operations is the image 





mensions. These are also point operations in the sense that spatial information is not con- 
sidered. although information is shared between images on a pointwise basis. Generally. 
these have special purposes. e. g. . for noise reduction and change or motion detection. 
The third class of operations aregeometric image operations. These are complementa- 
ry to point operations in the sense that they are not defined as functions of image intensity. 
Instead. they are functions of spatial position only. Operations of this type change the ap- 
pearance of images by changing the coordinates of the intensities. This can be as simple as 
image translation or rotation. or it may include more complex operations that distort or 
bend an image. or “morph” a video sequence. Since our goal. however. is to concentrate 
on digital image processing of real-world images. rather than the production of special 
effects, only the most basic geometric transformations will be considered. More complex 


and time-varying geometric effects are more properly considered within the science of 
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computer graphics. 


Notation 


Point operations, algebraic operations. and geometric operations are easily defined on 
images of any dimensionality including digital video data. For simplicity of presentation, 
we will restrict our discussion to two-dimensional images only. The extensions to three or 
higher dimensions are not difficult. especially in the case of point operations. which are in- 
dependent of dimensionality. In fact. spatial temporal information is not considered in their 


definition or application. 


We will also only consider monochromatic images. sinc ions to color or other 
multispectral images is either trivial. in that the same op s are applied identically to 


each band (e. g.. R, G, B). or they are defined as complex color space operations, 






which goes beyond what we want to cover in this 


Suppose then that the single-valued nN, 


m snz). The image is assumed to be of fi- 


per. 


o be considered is defined on a two-di- 








mensional discrete-space coordinate sys: n 


nite support. with image domain | 0. x[0. M-1]. Hence the nonzero image data 


can be contained in a matrix or arkakoPdimensions N x Here columns). This discrete- 
space image will have origi: x y sampling a ci mage f(x. y). Furthermore. 
the image f(n) is Mus be quantized to K Jevals 10，… . K—1); hence each pixel 
value takes one deni teger values. Fo: VA we will refer to these values as gray 
levels, reflecting thd Away in which mo nati images are usually displayed. Since f 
GD is both PS ii and quantized, it is digital. 


Image Histogram 


The basic tool that is used in designing point operations on digital images (and many 
other operations as well) is the image histogram. The histogram H; of the digital image f 
is a plot or graph of the frequency of occurrence of each gray level in f. Hence. H; is a 
one-dimensional function with domain (0, ++ . K —1) and possible range extending from 
0 to the number of pixels in the image. NM. 

The histogram is given explicitly by 

H,G)—J (10-1) 
iff contains exactly J occurrences of gray level k. for each k 一 0，…, K —1. Thus. 
an algorithm to compute the image histogram involves a simple counting of gray levels. 


which can be accomplished even as the image is scanned. Every image processing develop- 
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ment environment and software library contains basic histogram computation. manipula- 
tion, and display routines. 

Since the histogram represents a reduction of dimensionality relative to the original im- 
agef. information is lost-the image f cannot be deduced from the histogram H; except in 
trivial cases (when the image is constant valued). In fact. the number of images that share 
the same arbitrary histogram H; is astronomical. Given an image f with a particular histo- 
gram H,, every image that is a spatial shuffling of the gray levels of f has the same histo- 
gram H,, 

The histogramH ; contains no spatial information about f—it describes the frequency 
of the gray levels in f and nothing more. However. this information is still very rich, and 


a simple visual display of H; reveals much about the i examining the appearance 


many useful image processing operations can be derived AN age histogram. Indeed. 

of a histogram. it is possible to ascertain whether vels are distributed primarily 

at lower (darker) gray levels, or vice versa. A this can be ascertained to some de- 

gree by visual examination of the image yaa he human eye has a tremendous ability to a- 
h 


dapt to overall changes in luminance. ay obscure shifts in the gray-level distribu- 


tion. The histogram supplies an l ethod of determjning an image's gray-level dis- 
tribution, a 


For example, the ave tical density, or is the basic measure of an image's 
overall average “多 ss 
= 


or it «an DY puted from the image histogram: 












gray level. It can We céniputed directly from the image: 


AOD(C/) = fO n) (10-2) 


ye iS 3 
AODA) = MN UAH) (10-3) 


The AOD is a useful and simple meter for estimating the center of an image's gray- 
level distribution. A target value for the AOD might be specified when designing a point 
operation to change the overall gray-level distribution of an image. 

Figurel depicts two hypothetical image histograms. The one on the left has a heavier 
distribution of gray levels close to zero (and a low AOD). while the one on the right is 
skewed toward the right (a high AOD). Since image gray levels are usually displayed with 
lower numbers' indicating darker pixels. the image on the left corresponds to a predomi- 
nantly dark image. This may occur if the image f was originally underexposed prior to dig- 
itization. or if it was taken under poor lighting levels. or perhaps the process of digitization 
was performed improperly. A skewed histogram often indicates a problem in gray-level al- 


location. The image on the right may have been overexposed or taken in very bright light. 
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New Words and Phrases 





spatial adj. 空间 的 ， 立 体 的 matrix n. JERE 

rotation n， 旋 转 ， 转 动 quantize vt。 量 化 

distort vt. & vi. jill. KA histogram n. 直方 图 

morph vt， 变 形 occurrence n， 发 生 ， 出 现 

pointwise adj. AH astronomical adj. 天 文学 的 ， 极 大 的 

complementary adj. 互补 的 ; 补充 的 ascertain vt. Jj. 确定 

geometric adj. 几何 的 tremendous adj. 大 的 ， 惊 人 的 

spatialtemporal n. 时 空 obscure adj. 不 清楚 的 

monochromatic adj. $h., HHY depict vt. 

multispectral acdj。 多 谱 线 的 ， 多 光谱 的 skew 

finite adj. 47 PRAY ew. (OR ABDEA AE 
n AREE; 有 限 的 事物 pose vi， 上 曝光 过 度 

nonzero n， 非 零 a 


Notes aS xe 


1. gray-level igus age: M nes sin 图 像 是 每 个 像素 只 有 一 个 采样 


颜色 的 图 像 。 pele cauaa KM ku etr init 白色 
亮度 为 255)， te 频 色 的 不 同 深浅 ， 甚 至 可 以 是 不 同 亮度 上 的 
不 同 颜色 。 象 与 黑白 图 像 不 同 ， 在 计算 机 图 像 领 域 中 黑白 图 像 只 有 黑白 两 种 颜色 ， 


灰 度 图 像 在 黑色 与 白色 之 间 还 有 许多 级 的 颜色 深度 。 但 是 ， 在 数字 图 像 领域 之 外 ,， “黑白 
图 像 ” 也 表示 “ 灰 度 图 像 *， 例 如 ， 灰 度 的 照片 通常 称 为 “黑白 照片 ”。 在 一 些 关 于 数字 图 
像 的 文章 中 单 色 图 像 等 同 于 灰 度 图 像 ， 在 另外 一 些 文章 中 又 等 同 于 黑白 图 像 。 














Exercises 


I. Please translate the following words and phrases into Chinese. 


. point operation 
geometric operation 
arithmetic operation 


. image histogram 


Tp 


. motion detection 
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6. noise reduction 

7. geometric image operation 
8. time-varying 

9. algebraic operation 

10. discrete-space image 


ll. frequency of occurrence 


I. Fill in the blanks with the missing word(s) from the table below. 


zoomed 


geometric E 


gross 


shape 





gray 


multiple 


representations 


visual 





transitions 


redundancies 


coordina 


surveillance 





transform 


boundary 


equalization 





separation 


edges 


scene 


























magnific: pixel motion 
In some + it is desir the image into one that has a histogram 
of a specific . The procesof{ histogram Mo &eneralizes histogram 3 
which is the special case in thé target shape is L^ ME shaping can be applied 
when . images e scene, but take’ er mildly different lighting condi- 


tions. are to be os 


2. Dete is also usefu: are targets，for recognizing objects by 
their NE or computing treedin isional 
sional motion. 


3. If the time 


information from two-dimen- 


between frames is not small. then change detection can in- 
volve the discovery of scene changes. This can be useful for security or 
cameras. or in automated inspection systems. for example. 
4. The image zoom is a good example of a operation for which the type of in- 
terpolation is important. particularly at high . With nearest neighbor interpola- 


tion, many values in the image may be assigned the same scale. resul- 
ting in a severe "blotching" or "blocking" effect. 

5. At first glance. detection may seem trivial. since the boundary points 
can be simply defined as the from 1 to 0 (and vice versa). However. when there 
is noise present. boundary detection becomes quite sensitive to small noise artifacts. lead- 
ing to many useless detected : 


6. Compressed images are that require less storage than the nominal stor- 
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age. This is generally accomplished by coding of the data based on measured + re 
arrangement of the data to exploit patterns and in the data. and (in the case of 
lossy compression) . quantization of information. 

7. 'The basic idea of chain coding is to code contour directions instead of bit- 
by-bit binary image coding or even representations of the contours. Chain coding 
is based on identifying and storing the directions from each to its neighbor pixel 


on each contour. 
II. Translate the following sentences into Chinese. 


l. Images that occur in practical applications invariably suffer from random degrada- 


tions that are collectively referred to as noise. These degradat arise from numerous 






sources. including radiation scatter from the surface before image is sensed; electrical 


noise in the sensor or camera; channel noise as Ws is i is transmitted over a communica- 


tion channel; bit errors after the image is m 
s f 


2. Geometric ima xA opposite of point operations; they 


are, ina s 
modify the spatia ^ nil spatial ri Nea of pixels. but they do not modify 


graylevel NP 


























je E these operatlófi$ Can be quite complex and computationally in- 
tensive, especi AW i. applied to video sequences. However, the more complex geomet- 
ric operations are not much used in engineering image processing, although they are heavily 


used in the computer graphics field. 

















3. A simple but powerful tool for identifying and labeling the various objects in a bi- 
nary image is a process called region labeling. blob coloring. or connected component iden- 
tification. It is useful since once they are individually labeled. the objects can be separately 


manipulated. displayed. or modified. 
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Reading: Basic Binary Image Processing 


In this paper on basic methods. we explain and demonstrate fundamental tools for the 
processing of binary digital images. Binary image processing is of special interest. since an 
image in binary format can be processed with very fast logical (Boolean) operators. Often. 


age. such as object location, object boundaries, or the préseRé« or absence of some image 


a binary image has been obtained by abstracting essential A from a gray-level im- 
property. 


As we know. a digital image is an array of XXE r sampled image intensities, Each 
SNof 


gray level is quantized or assigned one of afi i eNOof numbers represented by B bits. In a 
binary image, only one bit is assigned to/eac el: B—1. implying two possible gray-lev- 


el values. 0 and 1. These two values 









Sually interpreted as Boolean; hence each pixel 
can take on the logical values, 0 or syn uer or "false". For example. 
these values might indicate gin. or presence, niesimage property in an associated 


gray-level image of te Me > where 1 at coordinate indicates the presence of 
the property at th rdinate in the meu v4 and 0 otherwise. This image prop- 


erty is quite 
stract properties," 


smoothness, etc. . might be indicated. 








sufficiently hig! [ow intensity (brightness), although more ab- 








ch as the presence or absence of certain objects, or smoothness or non- 





Since most image display systems and software assume images of eight or more bits 
per pixel, the question arises as to how binary images are displayed. Usually, they are dis- 
played using the two extreme gray tones, black and white, which are ordinarily represen- 
ted by 0 and 255, respectively, in a gray-scale display environment. There is no established 
convention for the Boolean values that are assigned to “black” and to “white”. In this paper 
we will uniformly use 1 to represent black (displayed as gray-level 0) and 0 to represent 
white (displayed as gray-level 255). However. the assignments are quite commonly re- 
versed, and it is important to note that the Boolean values 0 and 1 have no physical signifi- 
cance other than what the user assigns to them. 

Binary images arise in a number of ways. Usually. they are created from gray-level 
images for simplified processing or for printing. However. certain types of sensors directly 


deliver a binary image output. Such devices are usually associated with printed. handwrit- 








ten. or line drawing images. with the input signal being entered by hand on a pressure sen- 
sitive tablet. a resistive pad. or a light pen. In such a device. the (binary) image is first 
initialized prior to image acquisition: 
g(n)=0 (10-4) 
at all coordinates n. When pressure, a change of resistance, or light is sensed at some 
signed the value 1: 
g(m)=1 (10-5) 


This continues until the user completes the drawing. These simple devices are quite 


image coordinate n, then the image is 





useful for entering engineering drawings. handprinted characters, or other binary graphics 


in a binary image format. 

Image Thresholding | g 

Usually. a binary image is obtained from a gra’ «as; by some process of infor- 
mation abstraction. The advantage of the B-fol SE in the required image storage 
space is offset by what can be a MN. YS formation in the resulting binary im- 


age. However. if the process is accomp| helk h care, then a simple abstraction of infor- 


T 






mation can be obtained that can enhan Sequent processing, analysis, or interpretation 


of the image. 


The simplest such RENAE is the MM thresholding. which can be 
thought of as an miami of gray-level qu: X n. Suppose that a gray-level image f 
can take K possibje gray ay levels 0. 1. 2. . Define an integer threshold. T. that 

[^ di of T E (0, i EN eso. 


lies in the gr q 
The process 6f thresholding is a process of simple comparison: each pixel value inf is 
compared to T. Based on this comparison. a binary decision is made that defines the value 
of the corresponding pixel in an output binary image g: 
0 if>f@>T 
8g(n) = . 
l aftf@)<T 


Of course, the threshold T that is used is of critical importance. since it controls the 


(10-6) 


particular abstraction of information that is obtained. Indeed. different thresholds can pro- 
duce different valuable abstractions of the image. Other thresholds may produce little valu- 
able information at all. It is instructive to observe the result of thresholding an image at 
many different levels in sequence. 

As will be seen. image thresholding can often produce a binary image result that is 
quite useful for simplified processing. interpretation. or display. However. some gray-lev- 
el images do not lead to any interesting binary result regardless of the chosen thresholdT. 


Several questions arise: Given a gray-level image. how does one decide whether binari- 
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zation of the image by gray-level thresholding will produce a useful result? Can this be de- 
cided automatically by a computer algorithm? Assuming that thresholding is likely to be 
successful. how does one decide on a threshold level T? These are apparently simple ques- 
tions pertaining to a very simple operation. However. these questions turn out to be quite 


difficult to answer in the general case. In other cases. the answer is simpler. In all cases. 





however, the basic tool for understanding the process of image thresholding is the image 
histogram. 

Thresholding is most commonly and effectively applied to images that can be charac- 
terized as having bimodal histograms. 


clear modes; the one at the right either has a single mode, heavily overlapping. 






Figure 10. 1 depicts two hypothetical image histograms. bow on the left has two 


poorly separated modes, Bimodal histograms are often € t always) associated with 


images that contain objects and backgrounds having Qs antly different average bright- 
ness, This may imply bright objects on a NS nd. or dark objects on a bright 


background. m X 
Threshold T ^K 








Hk) Hak) 
0 | 大 天 -1 0 gray level k K-1 
AS (a) X [2 
Figure 10.1 H; cal listograms; (a) well fed modes and (b) poorly separated or indistinct modes, 


The goal,4 many applications. is to separate the objects from the background, and to 
label them as object or as background. If the image histogram contains well-separated 
modes associated with an object and with a background. then thresholding can be the 
means for achieving this separation. Practical examples of gray-level images with well-sep- 
arated bimodal histograms are not hard to find. For example. an image of machine-printed 
type (like that being currently read). or of handprinted characters. will have a very dis- 
tinctive separation between object and background. Examples abound in biomedical appli- 
cations, where it is often possible to control the lighting of objects and background. Stand- 
ard bright-field microscope images of single or multiple cells (micrographs) typically con- 
tain bright objects against a darker background. In many industry applications. it is also 
possible to control the relative brightness of objects of interest and the backgrounds they 
are set against, For example. machine parts that are being imaged (perhaps in an automa- 


ted inspection application) may be placed on a mechanical conveyor that has substantially 








different reflectance properties than the objects. 


New Words and Phrases 


binary digital image 二 值 数字 图 像 

Boolean adj. 布尔 数学 的 

interpret vt. MFE., HE, AF 
equivalently adv. 等 价 地 

coordinate vt. & vi， 使 协调 ， 使 调和 ; 整合 
pressure sensitive tablet 压 敏 片 

initialize vt. MAE 

acquisition n， 获 得 ; 取得 

thresholding n. fÉ 


Exercises 





ges? 


3. Wher 


I. Answer the UE ions. 
1. What are demo undamental to 


2. m mn dyed OX 


binarization n。 二 值 化 

pertain vi. XF. AK; WA; 附属 
bimodal adj. JUH 

depict vt. $22. HE 

overlapping n. Wd. 44% 

label n. 标签 
贴 标 


S 
NF 


at 





vt. 


processing of binary digital ima- 


binary image obtained from? 


I. Translate the following sentences into Chinese. 


l. Binary image processing is of special interest. since an image in binary formatcan 


be processed with very fast logical (Boolean) operators. Often. a binary image has been 


obtained by abstracting essential information from a gray-level image. such as object loca- 


tion. object boundaries. or the presence or absence of some image property. 

















2. Since most image display systems and software assume images of eight or more bits 


per pixel. the question arises as to how binary images are displayed. Usually. they are dis- 
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played using the two extreme gray tones. black and white. which are ordinarily represen- 


ted by 0 and 255. respectively. in a gray-scale display environment. 

















3. Usually. a binary image is obtained from a gray-level image by some process of in- 
formation abstraction. The advantage of the B-fold reduction in the required image storage 


space is offset by what can be a significant loss of information in the resulting binary im- 





age. However. if the process is accomplished with care. then, aS abstraction of infor- 





mation can be obtained that can enhance subsequent processi analysis. or interpretation 





of the image. 

















E X- m ng 
HA 
Full-reference Quality Estimation for Images with Different Spatial Resolutions 


Multimedia communication is becoming pervasive because of the progress in wireless 
communications and multimedia coding. Estimating the quality of the visual content accu- 
rately is crucial in providing satisfactory service. State of the art visual quality assessment 
approaches are effective when the input image and reference image have the same resolu- 
tion. However. finding the quality of an image that has spatial resolution different than 
that of the reference image is still a challenging problem. To solve this problem. we devel- 
op a quality estimator (QE), which computes the quality of the input image without resam- 
pling the reference or the input images. In this paper. we begin by identifying the potential 
weaknesses of previous approaches used to estimate the quality of experience. Next. we 
design a QE to estimate the quality of a distorted image with a lower resolution compared 


with the reference image. We also propose a subjective test environment to explore the suc- 
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cess of the proposed algorithm in comparison with other QEs. When the input and test im- 
ages have different resolutions. the subjective tests demonstrate that in most cases the pro- 
posed method works better than other approaches. In addition. the proposed algorithm al- 


so performs well when the reference image and the test image have the same resolution. 


Segmentation-driven Image Registration-application to 4D DCE-MRI Recordings 
of the Moving Kidneys 


Dynamic contrast enhanced magnetic resonance imaging (DCE-MRD of the kidneys re- 
quires proper motion correction and segmentation to enable an estimation of glomerular fil- 


tration rate through pharmacokinetic modeling. Traditionall istration, segmenta- 






tion, and pharmacokinetic modeling have been applied se ially as separate processing 
steps. In this paper. a combined 4D model for inia registration and segmentation 
of the whole kidney is presented. To demonstra: de 
used normalized gradients as data term in SA 

e 


the time courses of the segmented regioné ae 


l in numerical experiments, we 
ition and a Mahalanobis distance from 


ining set for supervised segmentation, By 


applying this framework to an input,c Thg of 4D image time series. we conduct simul- 

taneous motion correction and, t ion E and background. The 
~ 

RI data from ten healthy vol- 


potential of the new approa: lemonstrated on re 
unteers. 


Ns ite for Spec agery Using Sparse Spatiospectral 


Masks 


Two model-based algorithms for edge detection in spectral imagery are developed that 
specifically target capturing intrinsic features such as is oluminant edges that are character- 
ized by a jump in color but not in intensity. Given prior knowledge of the classes of reflec- 
tance or emittance spectra associated with candidate objects in a scene. a small set of spec- 
tral-band ratios. which most profoundly identify the edge between each pair of materials. 
are selected to define a edge signature. The bands that form the edge signature are fed into 
a spatial mask. producing a sparse joint spatiospectral nonlinear operator. The first algo- 
rithm achieves edge detection for every material pair by matching the response of the opera- 


tor at every pixel with the edge signature for the pair of materials. The second algorithm is 





a classifier-enhanced extension of the first algorithm that adaptively accentuates distinctive 








features before applying the spatiospectral operator. Both algorithms are extensively veri- 


fied using spectral imagery from the airborne hyperspectral imager and from a dots-in-a- 











well midinfrared imager. In both cases. the multicolor gradient (MCG) and the hyperspec- 
tral/spatial detection of edges ( HySPADE) edge detectors are used as a benchmark for 
comparison. The results demonstrate that the proposed algorithms outperform the MCG 
and HySPADE edge detectors in accuracy. especially when isoluminant edges are present. 
By requiring only a few bands as input to the spatiospectral operator, the algorithms enable 
significant levels of data compression in band selection. In the presented examples. the re- 
quired operations per pixel are reduced by a factor of 71 with respect to those required by 
the MCG edge detector. 
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Text: Video PR 


homogeneous in some sense. Different feat NS homogeneity criteria generally lead to 


Video segmentation refers to the identifi eR N in a frame of video that are 


different segmentations of the same r mem color segmentation. texture seg- 


mentation, and motion ene 1 Ñi, result in ent segmentation maps. Fur- 







thermore, there is no carey hat any of the result entations will be semantical- 
ly meaningful. since a antic Ki, meaningful regu ay have multiple colors, multiple 
textures, or multi Nee NY 

In this pa X are primarily co with labeling independently moving image 
regions (motis SALAM or semantically meaningful image regions (video object 


plane segmentation). Motion segmentation (also known as optical flow segmentation) 
methods label pixels Cor optical flow vectors) at each frame that are associated with inde- 
pendently moving part of a scene. These regions may or may not be semantically meaning- 
ful. For example. a single object with articulated motion may be segmented into multiple 
regions. Although it is possible to achieve fully automatic motion segmentation with some 
limited accuracy. semantically meaningful video object segmentation generally requires user 
to define the object of interest in at least some key frames. Motion segmentation is closely 
related to two other problems. motion (change) detection and motion estimation. Change 
detection is a special case of motion segmentation with only two regions. namely changed 
and unchanged regions (in the case of a static camera) or global and local motion regions 
Cin the case of a moving camera). An important distinction between change detection and 
motion segmentation is that the former can be achieved without motion estimation if the 


scene is recorded with a static camera. Change detection in the case of a moving camera and 
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general motion segmentation. in contrast. require some sort of global or local motion esti- 
mation. either explicitly or implicitly. Motion detection and segmentation are also plagued 
with the same two fundamental limitations associated with motion estimation; occlusion 
and aperture problems. 

For example, pixels in a flat image region may appear stationary even if they are mov- 
ing as a result of an aperture problem (hence the need for hierarchical methods) ; or errone- 
ous labels may be assigned to pixels in covered or uncovered image regions as a result of an 
occlusion problem. 

It should not come as a surprise that motion/object segmentation is an integral part of 
many video analysis problems. including (i) improved motion (optical flow) estimation, 


Gi) three-dimensional (3-D) motion and structure estimatio) presence of multiple 








moving objects. and (iii) description of the temporal vari Sor the content of video. In 


the former case. the segmentation labels help to ide: ical flow boundaries (motion 


edges) and occlusion regions where the smoothne$ raint should be turned off. Seg- 
mentation is required in the second case. bec: tinct 3-D motion and structure parame- 
ters are needed to model the flow vect Read with each independently moving ob- 
ject. Finally, in the third case. se; ion information may be employed in an object- 


as opposed xpo description provided 


As with any segmentation problem. propenf c selection facilitates effective motion 
segmentation. In al J the application e 
— 


` 
~ 


level description of e D 


by individual flow vectors. 


rd image segmentation methods directly 
to estimated w vectors may no meaningful results, since an object moving 
in 3-D usually erates a spatially varying optical flow field. For example. in the case of a 


rotating object. there is no flow at the center of the rotation. and the magnitude of the flow 





vectors grows as we move away from the center of rotation. Therefore, a parametric mod- 
el-based approach. where we assume that the motion field can be described by a set of K 
parametric models. is usually adopted. In parametric motion segmentation. the model pa- 
rameters are the motion features. 

Then, motion segmentation algorithmsaim to determine the number of motion models 
that can adequately describe a scene. type/complexity of these motion models. and the spa- 
tial support of each motion model. The most commonly used types of parametric models 
are affine. perspective. and quadratic mappings. which assume a 3-D planar surface in mo- 
tion. In the case of a nonplanar object. the resulting optical flow can be modeled by a 
piecewise affme. perspective. or quadratic flow field if we approximate the object surface 
by a union of a small number of planar patches. Because each independently moving object 


or planar patch will best fit a different parametric model. the parametric approach may lead 








NH il MM | 


to an oversegmentation of motion in the case of nonplanar objects. 

It is difficult to associate a generic figure of merit with a video segmentation result. If 
segmentation is employed to improve the compression efficiency or rate control. then over- 
segmentation may not be a cause of concern. The occlusion and aperture problems are 
mainly responsible for misalignment of motion and actual object boundaries. Furthermore, 
model misfit possibly asa result of a deviation of the surface structure from a plane general- 
ly leads to oversegmentation of the motion field. 

In contrast. if segmentation is needed for object-based editing and composition as in 
the upcoming MPEG-45 standard. then it is of utmost importance that the estimated 


boundaries align with actual object boundaries perfectly. Even a si Fo pixel error may not 


be tolerable in this case. Although elimination of outlier i faints and imposing 
spatio temporal smoothness constraints on the ane p improve the chances of 
obtaining more meaningful segmentation results, MS bject segmentation in general 
requires specialized capture methods (chroma k: xe interaction ( semi-automatic 
methods). 


New Words and Phrases aw 

segmentation n. 4) il ; wey T m 错误 自 

frame n. HEME, fn al adj. MH 暂 存 的 
同类 的 


















































homogeneous adj E eter n. BR 
criteria n， 标 (criterion ROT A facilitate vi. fe. WK 
semantically a 语义 地 magnitude n. EK. EK; 量 级 
optical adj. 视觉 的 ， 光 学 的 parametric adj. 参数 的 ， 参 量 的 
vector n， 向 量 affine adj. 仿 射 的 
articulate adj. 发 音 清晰 的 ; 善于 表达 的 ; quadratic adj. 二 次 的 

有 关节 的 n， 二 次 方程 式 

vt. 清晰 地 发 ( 音 ); 言语 表达 ; nonplanar adj. 非 平面 的 ， 空 间 的 

(用 关节 ) 连 接 planar adj. 平面 的 
explicitly acw。 明 白地 ， 明 确 地 oversegmentation n. 重复 分 割 
implicitly adv. 含蓄 地 ， 暗 示 地 merit n. 价值 ， 优 点 
plague vt. fier. 造成 麻烦 vi. (HS. ARI 
occlusion n. FASE; 咬合; 堵塞 misalignment n. XNE., WH 
aperture n. fL, ib]; 光圈 misfit vt. XJ 不 适合 ; 不 适用 于 
stationary adj. 静止 的 , 平衡 的 n RER 


hierarchical adj. 分 层 的 deviation n. 1$ 85. WE. W22 
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chroma n. (AE. (fl AE 
Notes 


1, MPEG-4; MPEG T 1999 年 2 月 正式 公布 了 MPEG-4 (ISO/IEC 14496) 标 准 第 一 
版 本 。 同 年 年 底 MPEG-4 第 二 版 问世 ， 且 于 2000 年 年 初 正式 成 为 国际 标准 。MPEG-4 与 
MPEG-1 和 MPEG-2 有 很 大 的 不 同 。MPEG-4 不 只 是 具体 的 压缩 算法 ， 它 是 针对 数字 电 
视 、 交 互 式 绘图 应 用 (影音 合成 内 容 )、 交 互 式 多 媒体 (WWW、 资 料 搬 取 与 分 散 ) 等 整合 及 
压缩 技术 的 需求 而 制定 的 国际 标准 。MPEG-4 标准 将 众多 多 媒体 应 用 集成 于 一 个 完整 框架 
内 ， 旨 在 为 多 媒体 通信 及 应 用 环境 提供 标准 算法 及 工具 ， 从 而 建立 起 一 种 能 被 多 媒体 传 
输 、 存 储 、 检 索 等 应 用 领域 普遍 采用 的 统一 数据 格式 。 «e 


Exercises gS 
I. Please translate the following K abu ass into Chinese. 














motion segmentation 


video object plane E X» x 


optical flow vector, 以 
static camer gA V 
"der ia X 
neg T 

chromá keying 


semi-automatic method 


$e iw qe qm ge ge dS GER 


I. Fill in the blanks with the missing word(s) from the table below. 























semantically representation semantic classified 
features automatic clustering intensity 
samples occurrence technology values 
alternative flow definition texture 
1. The Hough transform is a well-known technique in which the data 
“vote” for the most representative feature in a quantized feature space. 
2. Motion segmentation approaches in general are as optical seg- 





mentation methods. which operate on precomputed optical flow estimates as an input 
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+ and direct methods. which operate on spatiotemporal values. 
3. It is difficult to achieve meaningful object segmentation by using fully 
methods based on low-level features such as motion. color. and . This 
is because a object may contain multiple motions. colors. textures. and so on. 
and the of semantic objects may depend on the context. which may not be possi- 
ble to capture by using low-level 
4. Chroma keying is an object-based video in which each video object is cap- 
tured individually in a special studio against a key color. 


5. Because chroma keying requires special studios or equipment to capture video ob- 


jects. an approach is interactive segmentation. using automated tools to aid a hu- 
man operator. To this effect, we assume that the contour of t st of the se- 
mantic object of interest is marked interactively by a hum rator. 

II. Translate the following sentences into Chi X 

l. Segmentation by dominant motion analyst fers to extracting one object (with the 
dominant motion) from the scene at a tithe. inant motion segmentation can be consid- 
ered as a hierarchically structured t approach, which starts by fitting a single para- 


metric motion model to the entiri e. and then dette frame into two regions. 
~ 
those pixels that are well ee by this A otion model and those that 
are not. one NS 
DNA 3 BE 


2. Multiple object segmentation can be achieved by repeating the same procedure on 

















the residual image after each object is extracted. Once the first dominant object is segmen- 
ted and tracked, the procedure can be repeated recursively to segment and track the next 
dominant object after excluding all pixels belonging to the first object from the region of a- 
nalysis. Hence, the method is capable of segmenting multiple moving objects in a top-down 


fashion if a dominant motion exists at each stage. 
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3. Multiple motion segmentation methods let multiple motion models compete against 





each other at each decision site. They consist of three basic steps. which are strongly inter- 
related; estimation of the number K of independent motions. estimation of model parame- 


ters for each motion. and determination of support of each model ( segmentation labels). 

















Reading: Video Compression Pun 


Awide variety of digital video acd exist. They range from simple 
low-resolution and low-bandwidth RO imedia, picturephone) to very high- 


resolution and high-bandwidth (HDTV . This paper will present requirements of 


current and future digital video applic% co i demands they place on the video com- 


importance of Mrs the transmission of dig- 
n 


pression system. 


Asa way to demons] 
required by a digital television sig- 


ital video television signals is presented. The b: 
nal is approximate e-balf the number ofgi jc elements (pixels) displayed per second. 


is the distance between scanning lines. and 


The analog pi du n the vertical n s 
the horizontal ension is the distance the scanning spot moves during 1/2 cycle of the 


highest video signal transmission frequency. The bandwidth is given by: 


By = 0. 8 FO (Mi) (Ry) (1-1) 


where 

Bw — system bandwidth. 

Fy — number of frames transmitted per second (fps). 

N, —number of scanning lines per frame, 

Ry — horizontal resolution (lines), proportional to pixel resolution. 

The National Television Systems Committee (NTSC) aspect ratio is 4/3, the constant 
0. 5 is the ratio of the number of cycles to the number of lines. and the factor 0. 84 is the 
Íraction of the horizontal scanning interval that is devoted to signal transmission. 


The NTSC transmission standard used for television broadcasts in the United States 


has the following parameter values: Fr=29. 97 fps. N, —525 lines. and Ry —340 lines. 


This yields a video system bandwidth Bw of 4. 2 MHz for the NTSC broadcast system. In 
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order to transmit a color digital video signal. the digital pixel format must be defined. The 
digital color pixel is made of three components: one luminance (Y) component occupying 8 
bits. and two color difference components (U and V) each requiring 8 bits. The NTSC pic- 
ture frame has 720X480 2 total luminance and color pixels. In order to transmit this in- 


formation for an NTSC broadcast system at 29. 97 frames/s. the following bandwidth is re- 








quired 
bitrate 
Digital Bw ~ ELE 
(29. 97 fps) X (24 bits/pixeD X (720 X 480 X 2 pixels/frame) 
2 
— 249MHz (11-2) 
This represents an increase of 一 59 times the availab D> can and ~41 
times the full transmission channel bandwidth (6 Mo rrent NTSC signals. HDTV 
picture resolution requires up to three times more un than this example! CTwo 
transmission channels totaling 12 MHz are alloc r ty ae HDTV transmissions. ) 


It is clear from this example that terres elevision broadcast systems will have to use 







digital transmission and digital video ssion to achieve the overall bitrate reduction 


and image quality required for "QR h v 
The example not is üt the "a idth requirements for digital 
heiss 


video information, but 





ectly brings up t e of digital video quality require- 


ments, The trade en tac Mm and ae istortion is a fundamental issue facing 
the design NG mpression systems 
To this AÀ NE is important to fully chdracterize an application’s video communications 


requirements before designing or selecting an appropriate video compression system. Fac- 
tors that should be considered in the design and selection of a video compression system in- 
clude the following items. 

1. Video characteristics: video parameters such as the dynamic range, source statis- 
tics. pixel resolution. and noise content can affect the performance of the compression sys- 
tem, 

2. Transmission requirements; transmission bitrate requirements determine the pow- 
er of the compression system. Very high transmission bandwidth. storage capacity. or 
quality requirements may necessitate lossless compression. Conversely. extremely low bi- 
trate requirements may dictate compression systems that trade off image quality for a large 
compression ratio. Progressive transmission is a key issue for selection of the compression 
system. It is generally used when the transmission bandwidth exceeds the compressed vide- 


o bandwidth. Progressive coding refers to a multiresolution. hierarchical. or subband en- 
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coding of the video information. It allows for transmission and reconstruction of each reso- 
lution independently from low to high resolution. In addition. channel errors affect system 
performance and the quality of the reconstructed video. Channel errors can affect the bit 
stream randomly or in burst fashion. The channel error characteristics can have different 
effects on different encoders, and they can range from local to global anomalies. In gener- 
al. transmission error correction codes (ECC) are used to mitigate the effect of channel er- 
rors, but awareness and knowledge of this issue is important. 

3. Compression system characteristics and performance: the nature of video applica- 


tions makes many demands on the video compression system. Interactive video applications 









such as videoconferencing demand that the video compression systems have symmetric ca- 


pabilities. That is. each participant in the interactive video sı st have the same vid- 
eo encoding and decoding capabilities. and the system p nce requirements must be 
met by both the encoder and decoder. In contrast. isign broadcast video has signifi- 
cantly greater performance requirements at the t. itter because it has the responsibility 
of providing real-time high quality MSS. that meets the transmission channel 
capacity. Digital video system implemenfatioR requirements can vary significantly. Desktop 


televideo conferencing can be impleme y using software encoding and decoding. or it 





may require specialized hardwar: transmission cos to provide a high-quality 
performance. The character of the applicatio: te the suitability of the video 
compression algorithm rticular system i tations. The importance of the en- 


coder and system eméntation decision e overstated; system architectures and 





performance D s are changing aci pace and the choice of the best solution re- 
quires sarei ysis of the all possible system and encoder alternatives. 


4. Rate-distortion requirements; the rate-distortion requirement is a basic considera- 





tion in the selection of the video encoder. The video encoder must be able to provide the bi- 
trate(s) and video fidelity (or range of video fidelity) required by the application. Other- 
wise. any aspect of the system may not meet specifications. For example. if the bitrate 
specification is exceeded in order to support a lower MSE. a larger than expected transmis- 
sion error rate may cause a catastrophic system failure. 

5. Standards requirements: video encoder compatibility with existing and future 
standards is an important consideration if the digital video system is required to interoper- 
ate with existing or future systems. A good example is that of a desktop videoconferencing 
application supporting a number of legacy video compression standards. This results in re- 
quiring support of the older video encoding standards on new equipment designed for a ne- 
wer incompatible standard. Videoconferencing equipment not supporting the old standards 


would not be capable or as capable to work in environments supporting older standards. 
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New Words and Phrases 














vertical adj. 垂直 的 ， 直 立 的 multiresolution 2。 多 分 辩 率 
n EHR., wA subband n. 部 分 波段 ; 次 能 带 
horizontal adj. 水 平 的 ; 地 平 线 的 ; 同一 阶 anomaly n. 异常 ; 不 规则 ; 反常 事物 
层 的 ECC abbr. 244413 (Error Correction Code) 
n， 水 平 线 ， 水 平面 ; 水 平 位 置 。 symmetric adj. 对 称 的 ; 匀称 的 
fraction n. 分数 ;部 分 rate-distortion n， 率 失真 
terrestrial adj. 地 球 的 ; 陆地 的 fidelity n. 保 真 度 
n， 陆 地 生物 ; 地 球 上 的 人 catastrophic adj 的 ;悲惨 的 ; 灾难 性 
bitrate n. ERR; 位 率 ; 位 速率 的 ， Ki 
necessitate vt. 使 成 为 必需 ， 需 要 ; 迫使 


xv 
Exercises AN 


I. Answer the n» que: «e 
S in the eit a of a video compres- 


1. What factors sou ie © 


sion system? 
2. What is v hafacteristic? PNG 
3. "AS SCT acis err 


Ts S iate the following sentences into Chinese. 








1. A wide variety of digital video applications currently exist. They range from simple 
low-resolution and low-bandwidth applications ( multimedia. Picturephone) to very high- 
resolution and high-bandwidth (HDTV) demands. 

















2. Asa way to demonstrate the importance of video compression. the transmission of digital 
video television signals is presented. The bandwidth required by a digital television signal is ap- 


proximately one-half the number of picture elements (pixels) displayed per second. 
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3. The nature of video applications makes many demands on the video compression 
system, Interactive video applications such as videoconferencing demand that the video 
compression systems have symmetric capabilities. That is. each participant in the interac- 
tive video session must have the same video encoding and decoding capabilities. and the 


system performance requirements must be met by both the encoder and decoder. 

















Hierarchical Aggrega or di a Overview, Techniques, 


and Design ra XE 


We prese model for building. ein and interacting with multiscale represen- 
tations of information visualization techniques using hierarchical aggregation. The motiva- 
tion for this work is to make visual representations more visually scalable and less clut- 
tered. The model allows for augmenting existing techniques with multiscale functionality, 
as well as for designing new visualization and interaction techniques that conform to this 
new class of visual representations. We give some examples of how to use the model for 
standard information visualization techniques such as scatterplots, parallel coordinates. and 
node-link diagrams. and discuss existing techniques that are based on hierarchical aggrega- 
tion. This yields a set of design guidelines for aggregated visualizations. We also present a 
basic vocabulary of interaction techniques suitable for navigating these multiscale visualiza- 


tions. 
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Improving the Efficiency of Viewpoint Composition 


In this paper. we concentrate on the problem of finding the viewpoint that best satis- 
fies a set of visual composition properties, often referred to as Virtual Camera or View- 
point Composition. Previous approaches in the literature. which are based on general opti- 
mization solvers, are limited in their practical applicability because of unsuitable computa- 
tion times and limited experimental analysis. To bring performances much closer to the 
needs of interactive applications. we introduce novel ways to define visual properties. eval- 
uate their satisfaction, and initialize the search for optimal viewpoints. and test them in 
several problems under various time budgets. quantifying als; e first time in the do- 
main, the importance of tuning the parameters that NS chavior of the solving 
process. While our solver, as others in the literatur: based on Particle Swarm Optimi- 
zation, our contributions could be applied to stoMhastic search process that solves 
through many viewpoint evaluations. e n etic algorithms employed by other pa- 
pers in the literature. The complete so: of our approach. together with the scenes 


and problems we have employed. can fownloaded from https: //bitbucket. org/rra- 
oo 
< 


Assessing the Ability A esi S Simulation to Evaluate Physi- 
cal Risk Factors 


Nowada dh process of workstation deii tends to include assessment steps in a vir- 


tual environment (VE) to evaluate the ergonomic features. These approaches are cost-ef- 
fective and convenient since working directly on the digital mock-up in a VE is preferable to 
constructing a real physical mock-up in a real environment (RE). This study aimed at un- 
derstanding the ability of a VR-based assembly tasks simulator to evaluate physical risk 
factors in ergonomics. Sixteen subjects performed simplified assembly tasks in RE and VE. 
Motion of the upper body and five muscle electromyographic activities were recorded to 
compute normalized and averaged objective indicators of discomfort. that is. rapid upper 
limb assessment score. averaged muscle activations. and total task time. Rated perceived 
exertion (RPE) and a questionnaire were used as subjective indicators of discomfort. The 
timing regime and complexity of the assembly tasks were investigated as within-subject fac- 
tors. The results revealed significant differences between measured indicators in RE and 
VE. While objective measures indicated lower activity and exposure in VE. the subjects 


experienced more discomfort than in RE. Fairly good correlation levels were found between 











RE and VE for six of the objective indicators. This study clearly demonstrates that ergo- 
nomic studies of assembly tasks using VR are still challenging. Indeed. objective and sub- 
jective measurements of discomfort that are usually used in ergonomics to minimize the 
risks of work-related musculoskeletal disorders development exhibit opposite trends in RE 
and VE. Nevertheless, the high level of correlation found during this study indicates that 


the VR-based simulator can be used for such assessments. 
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WIRELESS COMMUNICATION 


Text: Speech ied 


Introduction V 
l. Speech Telephony as mi Multimedia in 


When O. NuBbau 二 人 in the first led transmission of speech and music 
in the experimental, physi eT at Graz Univefsi dl pridie. in 1904, nobody would 
have es the YAmendous growth ine ss multimedia communications 100 years af- 


ter this history chievement. 





Many new media types have emerged such as text. image. and video. and modern 
services range from essentially one-way media download, browsing. messaging. application 
sharing. broadcasting. and real-time streaming to two-way. interactive. real-time conver- 
sations by text (chat). speech, and video telephony. Still, speech telephony is the back- 
bone of all conversational service and continues as an indispensable functionality of any mo- 
bile communications terminal. It is for this reason that we will focus our discussion of 
source-coding methods on speech signals—i. e. . on their efficient digital representation for 
transmission (or storage) applications. 

The success story of digital speech coding started with the introduction of digital 
switching in the Public Switched Telephone Network (PSTN) using Pulse Code Modula- 
ted (PCM)' speech at 64 Kbps and continued with a cascade of advanced compression 
standards from 32 Kbps in the early 1980s over 16 Kbps to 8 Kbps in the late 1990s. all 


with a focus on long-distance circuit multiplication while maintaining the traditional high- 
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quality level of wireline telephony (toll quality). For wireless telephony. the requirements 
on digital coding were rather stringent with regard to bit rate and complexity from the very 
beginning in the mid-1980s whereas some compromises in speech quality seemed acceptable 
because users either had never made the experience of mobile telephony before or. if so. 
their expectations were biased from the relatively poor quality of analog mobile radio sys- 
tems. This situation changed quickly, and the first standards introduced at the beginning 
of the 1990s were completely overturned within 5 years with significant quality enhance- 
ments through new coding algorithms. maintaining a bit rate of about 12 Kbps where the 


accompanying complexity increase was mitigated by related advances in microelectronics 
and algorithm implementation. 
2. Source-coding Basics K 3 
The foundations for source coding were laid ox on [1959]. who developed 
s S: 


ssióri channels but also rate-distortion 





not only channel-coding theory for imperfect tr: 
theory for signal compression. The latter TS. 


(1) a stochastic source model whic! 


ased on two components, 
s to characterize the redundancy in source 
information, 

(2) a distortion measure whi lacterizes the relg¥ahce of source information for a 
user. For asymptotically in Yide" del ay and compl NA certain simple source models 
and distortion measures Xt can be shown that the NC s an achievable lower bound on the 
bit rate necessary chi e a given distortj and, vice versa, that their exists an a- 
chievable lo NS: on the distortion & tolerated for a given bit rate. While complex- 
ity is an Mos dling obstacle. delay is a substantial issue in telephony. as it degrades 


the interactive quality of conversations severely when it exceeds a few 100 ms. Therefore. 





the main insight from rate-distortion theory is the existence of a three-way tradeoff among 
the fundamental parameters rate. distortion. and delay. Traditional telephony networks 
operate in circuit-switched mode where transmission delay is essentially given by the elec- 
tromagnetic propagation time and becomes only noticeable when dealing with satellite 
links. However. packet-switched networks are increasingly being used for telephony as 
well—as in Voice over Internet Protocol ( VoIP) systems—where substantial delays can be 
accumulated in router queues, etc.. In such systems, delay becomes the most essential pa- 
rameter and will determine the achievable rate-distortion tradeoff. 

Source coding with a small but tolerable level of distortion is also known as lossy cod- 
ing whereas the limiting case of zero distortion is known as lossless coding. In most cases. 
a finite rate allows lossless coding only for discrete amplitude signals which we might con- 


sider for transcoding of PCM speech—i. e. . the digital compression of speech signals which 
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have already been digitized with a conventional PCM codec. However. for circuit-switched 
wireless speech telephony. such lossless coders have two drawbacks: first. they waste the 
most precious resource—i. e. . the allocated radio spectrum—as they invest more bits than 
necessary to meet the quality expectations of a typical user; second. they often result in a 


bit stream with a variable rate—e. g. . when using a Huffman coder—which cannot be 





matched efficiently to the fixed rate offered by circuit switched transm 

Variable-rate coding is, however. a highly relevant topic in packet-switched networks 
and in certain applications of joint source channel coding for circuit-switched networks. 
While Shannon's theory shows that. under idealized conditions. source coding and channel 
coding can be fully separated such that the two coding steps can be designed and optimized 
independently. this is not true under practical constraints su ite delay or time-var- 
ying conditions where only a joint design of the source Q nnel coders is optimal. In 
this case, a fixed rate offered by the network can "e ageously split into a variable 
source rate and a variable channel code rate. s$ 

3. Speech Coder Designs = x 

Source-coding theory teaches us © use models of source redundancy and of user- 
defined relevance in the design o h-coding MP ^ cR relevance aspects will 


. ~ 
be discussed later; howeverd use of the source, e 
tion of speech coder desins. 


(D be. a use source models icitly to design an adaptive dynamical sys- 


es rise to a generic classifica- 









tem which ma Noi nal speech wavefói a processed waveform that can be transmitted 
with fewer MN f the given digital channel. The decoder essentially inverts encoder processing 
to restore a faithful approximation of the original waveform. All waveform coders share the prop- 
erty that an increase of the bit rate will asymptotically result in lossless transcoding of the original 
PCM waveform, For such systems. the definition of a coding error signal as the difference be- 
tween the original and the decoded waveform makes sense (although it is no immediate measure of 
the perceptual relevance of the distortion introduced). 

(2) Model-based coders or vocoders rely on an explicit source model to represent the 
speech signal using a small set of parameters which the encoder estimates. quantizes, and 
transmits over the digital channel. The decoder uses the received parameters to control a 
real-time implementation of the source model that generates the decoded speech signal. An 
increase of the bit rate will result in saturation of the speech quality at a nonzero distortion 
level which is limited by systematic errors in the source model. Only recently. model-based 
coders have advanced to a level where these errors have little perceptual impact. allowing 


their use for very-low-rate applications (2. 4 Kbps and below) with slightly reduced quality 
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constraints, Furthermore. due to the signal generation process in the decoder. the decoded 





waveform is not synchronized with the original waveform and. therefore. the definition of a 
waveform error is useless to characterize the distortion of model-based coders. 

(3) Hybrid coders aim at the optimal mix of the two previous designs. They start out 
with a model based approach to extract speech signal parameters but still compute the mod- 
eling error explicitly on the waveform level. This model error or residual waveform is 
transmitted using a waveform coder whereas the model parameters are quantized and trans- 
mitted as side information. The two information streams are combined in the decoder to re- 
construct a faithful approximation of the waveform such that hybrid coders share the 
asymptotically lossless coding property with waveform coders. Their advantage lies in the 
explicit parameterization of the speech model which allows ,us ploit more advanced 
models than is the case with pure waveform coders which a single invertible dynam- 


ical system for their design. 





The model-based view of speech-coding desi sts that description of a speech- 


coding system should always start with SE 
le 


tion of the underlying speech model. Tj 


sut 






hat typically contains an implementa- 
r is then obtained as the signal analysis 


system that extracts the relevant m ameters and residual waveform. Therefore. the 






encoder has a higher complexi ðu he decoder and i re difficult to understand and 


implement. In this sense. azspéech-coding WM. ecify only the decoder and the 


format for transmitted reams while leaving t 


to industrial NA UN 
we 


Further NN issues 


esign of the best matching encoder 


The reliance on source models for speech coder design naturally results in a depend- 
ence of coder performance on the match between this model and the signal to be encoded. 
Any signal that is not clean speech produced from a single talker near the microphone may 
suffer from additional distortion which requires additional performance testing and, possi- 
bly, design modifications. Examples of these extra issues are the suitability of a coder for 
music (e. g. . if put on hold while waiting for a specific party). for severe acoustic back- 
ground noise (e. g. . talking from the car. maybe with open windows). for babble noise 
Írom other speakers (e. g. . talking from a cafeteria). for reverberation (e. g. . talking in 
hands-free mode), etc. . 

Further system design aspects include the choice between narrowband speech as used 
in traditional wireline telephony (e. g. . an analog bandwidth from 300 Hz to 3. 4 kHz with 
8-kHz sampling frequency) and wideband speech with quality similar to Frequency Modula- 
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tion (FM) radio (e. g. . an analog bandwidth from 50Hz to 7 KHz with 16-KHz sampling 
frequency), which substantially enhances user experience with clearly noticeable speech 
quality improvements over and above the Plain Old Telephone Service (POTS). Second. e- 
ven with the use of sophisticated channel codes. the robustness of channel errors such as 
individual bit errors. bursts. or entire lost transmission frames or packets is also a source- 
coding design issue. 

Finally. within the network path from the talker to the listener. there may be several 
codec-tandeming steps where transcoding from one coding standard to another occurs. each 


time with a potential further loss of speech quality. 


New Words and Phrases «e 
browse vt. & vi. DUS; 随意 翻阅 d fed. foh. Bue 
ü 


n. WA c .| 网 码 译 码 器 
telephony n. 拨号 服务 ; 电话 语音 A evt. A. 4 HK; Weee 拨 给 


backbone ». 分水岭; 支柱 Q^ ectrum n. 谱 ， 光 谱 


indispensable adj. AN FY HR AY split vt. DA; 分 开 


n. ANB RD AY n. Xon 
统 的 ) 功 的 
ence ua 


functionality n. (计算 机 或 ü- 
adj， 知 觉 的 ， 有 知觉 的 ; 感性 


能 ; 功能 性 X- 
cascade n. an 



















3 x Aft vit， 使 …… 前 后 倒 署 ; 使 反 转 
toll o. fj OE A approximation n. 近似 值 ; 近似 
vi. WESS j vocoder n， 声 码 器 ; 自动 语音 合成 仪 
n， 税 ,通行 税 ; 通行 费 waveform n. JÉ 
stringent adj， 严 格 的 ; 迫切 的 hybrid n， 杂 种 ; 混合 物 ; 混合 词 
compromise n， 受 协 adj. 混合 的 ; 杂种 的 
vi。 折 中 解决 ; 妥协 residual adj. 残余 的 ; 残留 的 
zt。 违背 (原则 ) n. FIAT; 残渣 
bias». 偏差 ;背离 率 指标 modification n. i, 修正， 变更 
analog n. 模拟; 相似 体 acoustic adj。 听 觉 的 ; 声学 的 
acj， 模 拟 的 babble vi. MEMEAN 
overtune n. 序曲 vt， 含 糊 不 清 地 说 ; 泄露 
enhancement n， 增 益 n. HAR ALIA 
asymptotically adv. 渐 近 地 reverberation n， 混 响 ; 残 响 ; 回响 ; 反射 
dwindling adj. 逐渐 减少 的 hands-free mode 免 提 模 式 


trade off 权衡 narrowband n. ^Efr 











modulation n. mi; 调幅 度 


Notes 

















l. 公用 电话 交换 网 (PSTN，Public Switch Telephone Network). HI E A ÆN rp ai 
的 电话 网 。 众 所 周知 ，PSTN 是 一 种 以 模拟 技术 为 基础 的 电路 交换 网 络 。 在 众多 的 广 域 
互联 技术 中 ,通过 PSTN 进行 互 连 所 要 求 的 通信 费用 最 低 ， 但 其 数据 传输 质量 及 传输 速度 
也 最 差 ， 同 时 PSTN 的 网 络 资源 利用 率 也 比较 低 。 
2， 脉 冲 编码 调制 (Pulse Code Modulation) ， 简 称 PCM。 是 对 连续 变化 的 模拟 信号 
行 抽样 、 量 化 和 编码 产生 的 数字 信号 。PCM 的 优点 是 音质 好 ， 缺 点 是 体积 大 。PCM 可 以 
提供 用 户 从 2 Mbps 到 155 Mbps 速率 的 数字 数据 an gunar 图 像 传 送 、 

















= 





























远程 教学 等 其 他 业务 。 


Exercises 


I. Please translate the tores and phrases into Chinese. 
l. source-coding method XP 

2. analog mobile adig Aen et 
3. rate-distortio; dM 

4. cern stacle X 

5. pack RXvigched network Re 

6. VoIP stem 

7. Variable-rate coding 

8. circuit-switched network 

9 


. hybrid coder 
10. Frequency Modulation 


I. Fill in the blanks with the missing word(s) from the table below. 


























components mechanical encoded dynamic 

performance vocal converts remarkable 
audio independent relevant frequency 

distortion representation evaluation collapsed 
filtering perception imprecise modifications 
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l. The ultimate recipient of human speech is the human hearing system. a 








receiver with two broadband. directional antennas shaped for spatiotemporal (the 

outer ear) in terms of the individual. monaural Head Related Transfer Functions, HRTFs 

(functions of both azimuth angle and ) which along with interaural delay 
give rise to our spatial hearing ability. 

2. A highly adaptive. impedance-matching network (the middle ear) covers 

a range of more than 100 dB and a 3000-channel, phase-locking, threshold-based 


receiver (hair cells in the inner ear's cochlea) with very low self-noise (just above the level 





where we could hear our own blood-flow-induced noise) — — the signal to an extreme- 

ly parallel, low rate, synchronized — useful for distributed processing with low- 
power and circuits Cour nervous system). 

3. Auditory models in the form of psychoacous behavioral-models of 

are very popularin — — coding (like t Ye the einu MP3 standard) 

because they allow us to separate — — — from sees of the information, For in- 

stance, certain signal — — — may be mas! thers to such an extent that they be- 
come totally inaudible. a 

4. In particular. all the naturals Sources (which can be located at many positions 

along the vocal tract and which ten controlled toe ae aerodynamic flow) are 

into a single sour which drives the filt way. Furthermore, 


there is only a single a lea the om y ction system may switch between or 

even combine defe nasal branches oh tract. 
5. The ^ in source models £85 ch coder design naturally results in a de- 
pendence of N on the match between this model and the signal to be 
. Any signal that is not clean speech produced from a single talker near the micro- 


phone may suffer from additional which requires additional performance testing 


and. possibly. design 
II. Translate the following sentences into Chinese. 


1. While the “sound of music" includes a wide range of signal generation mechanisms 
as provided by an orchestra of musical instruments. the instrument for generating speech is 
fairly unique and constitutes the physical basis for speech modeling. even at the acoustic or 


perception levels. 
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2. Ina nutshell, speech communication consists of information exchange using a natu- 








ral language as its code and the human voice as its carrier. Voice is generated by an intri- 


cate oscillator—the vocal folds—which is excited by sound pressure from the lungs. 

















3. The proof of a speech coder lies in listening. Till today. tbe best way of evaluating 


the quality of a speech coder is by controlled listening tests d with sizable groups 


and Requir ts of Wireless Services 


of listeners (a couple of dozens or more). 























Reading: Appli 





Z% X 
Wireles i S is one oC engineering success stories of the last 25 
years—not Nou a scientific point of view, where the progress has been phenomenal 
but also in terms of market size and impact on society. Companies that were completely un- 
known 25 years ago are now household names all over the world, due to their wireless 
products, and in several countries the wireless industry is dominating the whole economy. 
Working habits, and even more generally the ways we all communicate, have been changed 
by the possibility of talking “anywhere, anytime”. 

For a long time. wireless communications has been associated with cellular telephony. 
as this is the biggest market segment, and has had the highest impact on everyday lives. In 
recent times. wireless computer networks have also led to a significant change in working 
habits and mobility of workers—answering emails in a coffee shop has become an everyday 
occurrence. But besides these widely publicized cases. a large number of less obvious appli- 
cations have been developed. and are starting to change our lives. Wireless sensor net- 
works monitor factories. wireless links replace the cables between computers and key- 


boards. and wireless positioning systems monitor the location of trucks that have goods i- 
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dentified by wireless Radio Frequency (RF) tags. This variety of new applications causes 
the technical challenges for the wireless engineers to become bigger with each day. This 
book aims to give an overview of the solution methods for current as well as future challen- 
ges. 

Quite generally, there are two paths to developing new technical solutions: engineer- 
ing driven and market driven. In the first case. the engineers come up with a brilliant sci- 
entific idea—without having an immediate application in mind. As time progresses. the 
market finds applications enabled by this idea. In the other approach. the market demands 
a specific product and the engineers try to develop a technical solution that fulfills this de- 
mand. In this paper. we describe these market demands. We start-out with a brief history 
of wireless communications. in order to convey a feeling of h science. as well as the 


market, has developed in the past 100 years. Then xe 
e 


services that constitute the majority of the wireless t\today. Each of these services 





escription of the types of 


makes specific demands in terms of data rate. xev of users. energy consump- 


tion, mobility, and so on, EN, 
History "i 
1. How It All Stai et 


When Nn the istory of comm 





+ we find that wireless communications 





is actually t x^ nnda and j rums did not require any wires or cables to 


function. Evene E "electromagnetic" (optical) communications are wireless; smoke 





signals are based on propagation of optical signals along a line-of-sight connection. Howev- 
er, wireless communications as we know it started only with the work of Maxwell and 
Hertz. who laid the basis for our understanding of the transmission of electromagnetic 
waves. It was not long after their groundbreaking work that Tesla demonstrated the trans- 
mission of information via these waves—in essence. the first wireless communications sys- 
tem. In 1898. Marconi made his well-publicized demonstration of wireless communications 
from a boat to the Isle of Wight in the English Channel. It is noteworthy that while Tesla 
was the first to succeed in this important endeavor. Marconi had the better public rela- 
tions, and is widely cited as the inventor of wireless communications. receiving a Nobel 
prize in 1909, 

In the subsequent years. the use of radio (and later television) became widespread 
throughout the world. While in the “normal” language. we usually do not think of radio or 


TV as "wireless communications". they certainly are. in a scientific sense. information 
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transmission from one place to another by means of electromagnetic waves. They can even 
constitute "mobile communications". as evidenced by car radios. A lot of basic resear- 
ches—especially concerning wireless propagation channels—were done for entertainment 
broadcasting. By the late 1930s. a wide network of wireless information transmission— 


though unidirectional—was in place. 
2. The First Systems 


At the same time, the need for bidirectional mobile communications emerged. Police 
departments and the military had obvious applications for such two-way communications, 
and were the first to use wireless systems with closed user groups. Military applications 
drove a lot of the research during. and shortly after. the Sec: d War. This was also 
the time when much of the theoretical foundations for comi isations in general were laid. 


Claude Shannon's groundbreaking work A Mathem: ory of Communications ap- 





peared during that time, and established the possi error-free transmission under re- 


strictions for the data rate and the Signal-to-Npi atio (SNR). Some of the suggestions 


in that work, like the use of optimum pge: cation in frequency-selective channels. are 








only now being introduced into wireless 

The 1940s and 1950s saw, s iocis cpm the use of Citizens’ Band 
(CB) radios became widespit establishing a ne mmunicating between cars on 
the road. cee these systems wa ko e transferring vital traffic infor- 
mation and ec sd within the close mity of the drivers owning such devices. 
but it lacked terface to the public die system. and the range was limited to some 
100 km. Ak 


telephone system was installed in the U. S. A. (St. Louis). This system did have an inter- 


face to the Public Switched Telephone Network (PSTN), the landline phone system, 


g on the power of the (mobile) transmitters. In 1946, the first mobile 


though this interface was not automated, but rather consisted of human telephone opera- 
tors. However, with a total of six speech channels for the whole city, the system soon met 
its limits. This motivated investigations of how the number of users could be increased, e- 
ven though the allocated spectrum would remain limited. Researchers at AT&-T’s Bell 
Labs found the answer: the cellular principle, where the geographical area is divided into 
cells; different cells might use the same frequencies. To this day. this principle forms the 
basis for the majority of wireless communications. 

Despite the theoretical breakthrough. cellular telephony did not experience significant 
growth during the 1960s. However. there were exciting developments on a different front: 
in 1957. the Soviet Union launched the first satellite (Sputnik ) and the U. S. A. soon fol- 


lowed. This development fostered research in the new area of satellite communications. 








Many basic questions had to be solved. including the effects of propagation through the at- 
mosphere, the impact of solar storms". the design of solar panels and other long-lasting 
energy sources for the satellites. and so on. 

To this day. satellite communications is an important area of wireless communica- 


tions. The most widespread application lies in satellite TV transmission. 


3. Analog Cellular Systems 


The 1970s saw a revived interest in cellular communications. In scientific research. 


and other quantities that determine the performance of analo ne systems, A high- 







these years saw the formulation of models for path loss. Doppler Ben fading statistics. 
l 
light of that work was Jakes’ book Microwave Mobile R: that summed up the state of 


the art in this area. The 1960s and 1970s also saw a | basic research that was originally 








intended for landline communications, but later a6 ed to be instrumental for wireless 


communications. For example. the basics « ive equalizers, as well as multicarrier 
communications. were developed durin 

For the practical use of wirelesst ny. the progress in device miniaturization made 
the vision of "portable" devices alistic. Cue Motorola and AT&T vied 
for leadership in this area a Adde vital contributi pon Telephone and Telegraph 
(NTT) established a c ercial cellphone sys wo in 1979. However, it was a 
Swedish company (b It up the first s: NA large coverage and automated switc- 
hing: up to tha/pori®, Ericsson AB had mostly known for telephone switches while 
radio ka was of limited interest to them, However, it was just that expertise 
in switching technology and the (for that time. daring) decision to use digital switching 
technology that allowed them to combine different cells in a large area into a single net- 
work. and establish the Nordic Mobile Telephone (NMT) system. Note that while the 
switching technology was digital. the radio transmission technology was still analog. and 
the systems became therefore known as analog systems. Subsequently. other countries de- 
veloped their own analog phone standards. The system in the U.S. A. . e. g. . was called 
Advanced Mobile Phone System (AMPS). 

An investigation of NMT also established an interesting method for estimating market 
size: business consultants equated the possible number of mobile phone users with the 
number of Mercedes 600 (the top-of-the-line luxury car at that time) in Sweden. Obvious- 
ly. mobile telephony could never become a mass market. could it? Similar thoughts must 
have occurred to the management of the inventor of cellular telephony, AT&T. Upon ad- 


vice from a consulting company. they decided that mobile telephony could never attract a 
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significant number of participants and stopped business activities in cellular communica- 





tions. 

The analog systems paved the way for the wireless revolution. During the 1980s. they 
grew at a frenetic pace and reached market penetrations of up to 10% in Europe. though 
their impact was somewhat less in the U. S. A.. In the beginning of the 1980s. the phones 
were "portable", but definitely not handheld. In most languages. they were just called 
“carphones”, because the battery and transmitter were stored in the trunk of the car and 
were too heavy to be carried around. But at the end of the 1980s. handheld phones with 
good speech quality and quite acceptable battery lifetime abounded. The quality had be- 


there just did not seem to be a need for further improvements. 


come so good that in some markets digital phones had difficulty e ablishing themselves— 


4. GSM and the Worldwide Cellular Revolution 
Even though the public did not see a need f. «^ analog to digital. the net- 
work operators knew better. Analog MS ad spectral efficiency. and due to the 


rapid growth of the cellular market. opefatofs Rad a high interest in making room for more 


customers, Also. research in commuwieafions had started its inexorable turn to digital 
communications. and that inclu: ital wireless A: a as well. In the late 


1970s and the 1980s, resea to spectrally effici netulation formats. the impact of 
channel distortions. 34 vin variations o AMAN as well as multiple access 
schemes and macie were explored inyresc: labs throughout the world. It thus be- 
came clear t cognoscenti that the ré; ld systems would soon follow the research. 

Again. LS p that led the way. The European Telecommunications Standards 
Institute (ETSD group started the development of a digital cellular standard that would 
become mandatory throughout Europe and was later adopted in most parts of the world: 
Global System for Mobile communications (GSM). The system was developed throughout 
the 1980s; deployment started in the early 1990s and user acceptance was swift. Due to ad- 
ditional features. better speech quality. and the possibility for secure communications. 
GSM based services overtook analog services typically within 2 years of their introduction. 
In the U.S. A. the change to digital systems was somewhat slower. but by the end of the 
1990s, this country also was overwhelmingly digital. 

Digital phones turned cellular communications. which was already on the road to suc- 


cess. into a blockbuster. By the year 2000. market penetration in Western Europe and Ja- 





pan had exceeded 50%. and though the U.S. A.. showed a somewhat delayed develop- 
ment. growth rates were spectacular as well. 


The development of wireless systems also made clear the necessity of standards. De- 
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vices can only communicate if they are compatible. and each receiver can “understand” each 
transmitter—i. e. . if they follow the same standard. But how should these standards be 
set? Different countries developed different approaches. The approach in the U.S. A. is 
“hands-off”: allow a wide variety of standards and let the market establish the winner (or 
several winners). When frequencies for digital cellular communications were auctioned off 
in the 1990s, the buyers of the spectrum licences could choose the system standard they 
would use. For this reason. three different standards are now being used in the U. S. A.. 
A similar approach was used by Japan. where two different systems fought for the market 
of Second Generation (2G) cellular systems. In both Japan and the U. S. A. . the networks 
based on different standards work in the same geographical A. allowing consumers to 
choose between different technical standards. 

The situation was different in Europe. When digital uem nications were introduced, 
usually only one operator per country (typically. tl ent publie telephone opera- 
tors) existed. If each of these operators would a X standard. the result would 
be high market fragmentation (i. e. ，a phas or each standard), without the bene- 
fit of competition between operators. 


Furthermore, roaming from c e. which for obvious geographical re- 
gions is much more frequent i an in the U. S. ipee would be impossible. 
It was thus logical to ce common sta ll of Europe. 

New Words and pay X V 


W Ke 
occurrence X 出 现 ; 遭遇 ， 事 件 bidirectional adj. XUI ÁS 








publicize vt. (fs 发 表 ; 为 …… 做 广告 optimum adj. 最 适宜 的 
jungle drum 丛林 鼓 n. 最 住 效果 ; 最 适宜 条 件 
cable n. Hiii. 电费 panel n. fiit; 面 ; HAs 控制 板 
vt， 发 电报 至 ; 电 传 wi， 把 …… 分 格 
vi. jak doppler spectra 多 普 勒 谱 
electromagnetic adj. 电磁 的 instrumental adj. RARAJ; 仪器 的 ; 有 帮助 
propagation 2， 传播 ， 传输 ， 草 延 ， 扩 展 ， 的 ; 起 作用 的 
波及 深度 equalizer n. 均衡 器 ,平衡 装置 ; 补偿 器 


groundbreaking adj. 开创 性 的 ， 突破 性 的 。 miniaturization n， 小 型 化 
noteworthy adj. 值得 注意 的 ， 显 著 的 , 3E coverage n， 范 围 ， 规 模 




















要 的 abound vi. £®. 盛产 
endeavor vt. & vi. iR. 试图; 尽力, 竭力 inexorable adj. 不 能 变更 的 ; 不 可 阻挡 的 
n. 努力 ， 尽力 spectrally adv. 项 灵 似 地 ， 可 怕 地 











mandatory adj. 强制 的 ; 命令 的 ; 受 委托 的 ”spectacular adj. 惊人 的 








n. 受托 者 n. 壮观 的 场面 ， 精 彩 的 表演 
blockbuster n， 重 磅 炸弹 ， 了 不 起 的 人 或 ”auction n， 拍 卖 ; 竞 卖 ; 标 售 
事 ; 大 片 ; 风靡 一 时 的 事物 ut. HR: ER 
penetration n. i238; JA fragmentation n. 4924. DERE 


Notes 


1. solar storms; 太阳 风暴 ， 指 太阳 在 黑子 活动 高 峰 阶 段 产 
风暴 随 太阳 黑子 活动 周期 每 11 年 发 生 一 次 ， 是 一 种 太阳 自 
都 会 有 峰 年 。 ARPER AN PORA E ERT ITERA 
R, 破坏 臭氧 层 ， 十 扰 无 线 通 信和 ， iio Se 


Exercises SU 
I. Answer the following M 


hications been ass th cellular telephony? 


的 剧烈 爆发 活动 。 太 阳 
性 变化 ， 每 个 周期 内 
流 ， 严 重 影响 地 球 的 空 













2. What are the twiXpaths to developing nical solutions? 
3. What are equirements of wires Sele 
— 


í A. 
I. Tradex le following sentences “intd Chinese. 


1. Fora long time, wireless communications has been associated with cellular teleph- 
ony, as this is the biggest market segment. and has had the highest impact on everyday 
lives. In recent times. wireless computer networks have also led to a significant change in 
working habits and mobility of workers—answering emails in a coffee shop has become an 


everyday occurrence. 

















2. Quite generally. there are two paths to developing new technical solutions: engineering 
driven and market driven. In the first case. the engineers come up with a brilliant scientific idea— 


without having an immediate application in mind. As time progresses. the market finds applica- 








= RARE d | 


tions enabled by this idea. In the other approach. the market demands a specific product and the 





engineers try to develop a technical solution that fulfills this demand. 

















3. Even though the public did not see a need for changing from analog to digital. the 


network operators knew better. Analog phones have a bad spectral efficiency. and due to 


more customers. Also. research in communications had start. xorable turn to digit- 


the rapid growth of the cellular market. operators had a high d in making room for 
al communications. and that included digital wireless co tions as well, 


Xr 
S 


?K 




















4. Digital phones turned ,ce| ómmunications. was already on the road to 
success, into a Mee e year 2000. ma ration in Western Europe and 
Japan had exceeded 507 though the U. owed a somewhat delayed develop- 
ment. growth ne ere Á eet as XE MONS 





so ia 














Abstract Reading 


Device-to-device Communications with Wi-Fi Direct: Overview and Experimentation 


Wi-Fi Direct is a new technology defined by the Wi-Fi Alliance aimed at enhancing di- 
rect device to device communications in Wi-Fi. Thus. given the wide base of devices with 


Wi-Fi capabilities. and the fact that it can be entirely implemented in software over tradi- 
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tional Wi-Fi radios. this technology is expected to have a significant impact. In this article 





we provide a thorough overview of the novel functionalities defined in Wi-Fi Direct. and 
present an experimental evaluation that portrays the performance to be expected in real sce- 
narios. In particular, our results quantify the delays to be expected in practice when Wi-Fi 
Direct devices discover each other and establish a connection, and the performance of its 
novel power saving protocols. To the best of the authors' knowledge this is the first article 
in the state of the art that provides a wide overview and experimental evaluation of Wi-Fi 


Direct. 
Fractional Frequency Reuse for Interference Management in LTE-advanced Hetnets 


Improvement of cell coverage and network capacity en challenges for the e- 
aL 


volving 4G cellular wireless communication aeo TE-Advanced networks. In 
this context, hierarchical layering of cells with: ro base stations coexisting with low- 
power and shortrange base stations (correspdpdi o picocells or femtocells) in a service 


area is considered to be an efficient soluti inc enhance the spectral efficiency of the net- 






work per unit area. Also. such a hierazkliical cell deployment. which is referred to as a 


heterogeneous network. or Het-) rovides Ré. ANN in the coverage of 
indoor and cell edge users ai €nisures better QoS > gers. Interference mitigation be- 
tween different layers A Pd the key issues E to be resolved for successful de- 
ployment of HAT this end. fast frgqueady’ response. FFR, is considered to be an 
efficient inte intPrference coordinatidhy ique for OFDMA-based HetNets. This ar- 
ticle REC in three state-of-the-art FFR deployment schemes; strict FFR, 
soft FFR. and FFR-3 schemes for OFDMA-based two-tier HetNets comprising macrocells 
overlaid with femtocells. Also, a variation of the FFR-3 scheme, which is referred to as 
the optimal static FFR COSFFR) scheme. is proposed. A broad comparison among all 
these FFR schemes is performed by using Monte Carlo simulations considering performance 
metrics such as outage probability. average network sum rate. and spectral efficiency. 
Simulation results show that. the average gains in spectral efficiency (bps/Hz) of the net- 
work are significantly higher for the proposed scheme when compared to the strict FFR. 
soft FFR. and FFR-3 schemes. 


Integration of Wearable Devices in a Wireless Sensor Network for an E-health Ap- 
plication 


Applications based on Wireless Sensor Networks for Internet of Things scenarios are 








on the rise. The multiple possibilities they offer have spread towards previously hard to i- 
magine fields. like e-health or human physiological monitoring. An application has been 
developed for its usage in scenarios where data collection is applied to smart spaces. aiming 
at its usage in fire fighting and sports. This application has been tested in a gymnasium 
with real. non-simulated nodes and devices. A Graphic User Interface has been implemen- 
ted to suggest a series of exercises to improve a sportsman/woman's condition, depending 
on the context and their profile. This system can be adapted to a wide variety of e-health 
applications with minimum changes. and the user will interact using different devices, like 


smart phones. smart watches and/or tablets. 
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Text; Integrated Optics an tonics 


Introduction 二 XS 
This paper contains the dees h active and passive optical components and 
devices which are utilized within IMiber communicapiggs to enable both signal manip- 


ulation and processing we es the impleme high-performance optical fi- 
ber communication system arid*hetworks. The inia 











n of such components and devices 






is also an importany/fnd ihtegral aspect to give iple optical processing functions which 
s : ~ 

are carried out, witli$üy recourse to convi ack to the electrical domain. Hence the de- 

velopment oft ated optics (IO) for the realization of optical and electro-optical ele- 


ments integrated together and. more recently, integrated photonics (IP) ^' in which device 
integration in large numbers can be provided on a single substrate are crucial to the further 
development of the optical fiber telecommunications network. 

In particular. IP refers to the fabrication and integration of several or many compo- 
nents onto a single planar substrate. Such components include beam splitters. couplers. 
gratings. polarization controllers. interferometers. sources. detectors and optical ampli- 
fiers. These components when integrated with planar waveguides constitute the basic 
building blocks to fabricate more complex planar devices that perform not only optical sig- 
nal guiding and coupling but also controlling functions such as switching. splitting. multi- 


plexing and demultiplexing of optical signals. 
Integrated Optics and Photonics Technologies 


Integrated technology for optical devices has developed within optical fiber communi- 
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cations so that it is now possible to fabricate a complete system onto a single chip. Integra- 
tion for such devices has become a confluence of several optical or photonic disciplines. 
Both IO and IP technologies are referred to in the above where the control of the optical de- 
vices distinguishes one technology from the other. Electronic control of the optical devices 
determines the terminology of IO whereas photons control the operation of IP devices. In 
addition, IP does not involve any optoelectronic conversion of optical signals and hence this 
technology is also termed as “all-optical”. Both IO and IP use planar waveguide technology 
to provide interconnections between optical components including the basic components for 
guiding and control of optical signals. IP technology. however. enables the fabrication of 
subsystems and systems to be realized onto a single substrate. Hence when both active and 


passive devices are monolithically integrated onto a single subs a multilayered inte- 






gration then these devices are normally referred to as IP s. while when both active 


and passive elements fabricated as individual devices Xs onnected together they form 
larger IO devices or circuits. Thus IP can also be<séen!&s a process for the miniaturization 
and integration of optical systems on a single subStrate, and therefore it may be considered 
as a further enhancement of IO technoldgy. necessarily as an alternative technology. 


Both IO and IP seek to provide an € to the conversion of an optical signal back in- 


ighab processing by ds such processing to be per- 

ce thin transpar' ric layers on planar substrates 

ides are used in IO NK produce small-scale and miniature 

optical wm cifcuits. The birthsaf I ay be traced back to basic ideas outlined 
6 








to the electrical regime prior to 8 
formed on the optical signal? 
which act as optical wa 
AE 

by Anderson4h 19: He suggested that/&@itrofabrication technology could be developed 
for pee tical devices with semiconductor and dielectric materials in a similar man- 
ner to that which had taken place with electronic circuits. It was in 1969, however. after 
Miller had introduced the term integrated optics when discussing the long-term outlook in 
the area. that research began to gain momentum, 

Developments in IO have passed the stage where both signal processing and logic func- 
tions can be physically realized. Furthermore. such devices may form the building blocks 
for future digital optical computers. Nevertheless. a number of these advances combine to 
be closely linked with developments in lightwave communication employing optical fibers. 

A major factor in the development of IO is that it is essentially based on single-mode 
optical waveguides and therefore tends to be incompatible with multimode fiber systems. 
Hence IO did not make a significant contribution to early deployed optical fiber systems. 
The advent. however. of single-mode transmission technology further stimulated work in 
IO in order to provide devices and circuits for these more advanced third-generation sys- 


tems. Furthermore. the continued expansion of single-mode optical fiber communications 
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has created a growing market for such IO components. It is also likely that new generations 
of optical fiber communication systems employing coherent and possibly soliton transmis- 
sion will lean heavily on IO and IP techniques for their implementation. 

The proposals for IO and IP devices and circuits which in many cases involve reinven- 
tions of electronic devices and circuits exhibit major advantages other than solely a compati- 
bility with optical fiber communications, Electronic circuits have a practical limitation on 
speed of operation at a frequency of around 1 010 Hz resulting from their use of metallic 
conductors to transport electronic charges and build up signals. The large transmission 
bandwidths Cover 100 GHz) currently under investigation for optical fiber communications 


are already causing difficulties for electronic signal processing within the terminal equip- 


ment. 
The use of light with its property as an aeri of extremely high fre- 
quency (1 014 to 1 015 Hz) offers the possibility of highis, 


faster than that conceivable employing electroni ul Interaction of light with materi- 
als such as semiconductors or transparent S : 


operation around 104 times 


curs at speeds in the range of 10 - 12 
(pico) to approaching 10 - 15 (femto) sécontfs (hus providing a basis for subpicosecond^ 
optical switching. 


The other major attribut 
- 








RUN by optical sig eracting within a responsive 
medium is the ability to uti i&htwaves of differe; e@encies Cor wavelengths) within 
eJ or device. Such frequ 


the same guided wave c y division multiplexing allows an in- 


formation M sd far superior t g offered by electronics. Moreover, in 
signal em S s it facilitates paral ecc to information points within an optical 
system, T£ N ibility for powerful parallel signal processing coupled with ultrahigh 
speed operation offers tremendous potential for applications within both communications 
and computing. 

The devices of interest in IO and IP are often the counterparts of microwave or bulk 
optical devices. These include junctions and directional couplers, switches and modulators, 
filters and wavelength multiplexers, lasers and amplifiers, detectors and bistable elements. 
It is envisaged that developments in this technology will provide the basis for the next gen- 
eration of optical networks where full monolithic integration will be used. 

The technology associated with the design and fabrication of IP circuits and devices de- 
pends upon different factors that mainly result from the characteristics of the substrate ma- 
terial on which the various devices are to be fabricated. The IP process may require serial, 
parallel or hybrid integration of independent devices. In serial integration of devices, dif- 
ferent elements of the optical chip can be interconnected in a consecutive manner and there- 


fore side, or edge-emitting. or conducting optical devices can be readily integrated on the 
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same substrate. In the parallel case the chip is constructed by developing columns of de- 
vices in which surface- or bottom-emitting devices can effectively be used whereas in hybrid 
integration IP technology the devices are fabricated using both serial and parallel integra- 
tion on the same substrate. To gain control of the optical signals. however. additional ele- 
ments can be developed separately or be directly attached to the IP circuit. or be intercon- 
nected to it by optical fibers. In addition. both active and passive devices may be required 
to be located on the same substrate and therefore hybrid IP integration demands multilay- 
ered IP circuits and components to be produced on a single substrate such that they must be 
compatible with three-dimensional structures of other IO or IP devices. 


The enabling technologies for IP mainly rely on silica-on-silicon (SOS) where the 





waveguide structures comprise three layers. namely the buff. ore and the cladding. 


Due to its refractive index match to silica-based optical S rtical light confinement in 
SOS can be achieved by increasing the refractive nexo ore layer relative to the sur- 
rounding glass. whereas lateral confinement is. i eis structuring of the core 
layer. The real benefit of SOS. however. i AND y to apply wafer-scale. planar lithog- 
raphy and processing techniques to inte; K vid numbers of functions either as ar- 
rays of identical devices or in the Customized circuit configurations on single or 
multiple chips. This integration ity offers an Si atom for the implementa- 


tion of typical fiber-based f nalities such as o er splitters or combiners, cou- 
plers, wavelength-selec ouplers. multipl multiplexers and optical gain ele- 
ments, Furtherm OP cal switches ani sori attenuators based on the thermo- 


optic effect ENS. Iso E fabricated. In a * some devices can be fabricated using a sili- 
com es -) gel process which cannot be implemented using SOS techniques. The 
SOL gel process is a versatile solution-based technique for making ceramic and glass mate- 
rials which involves the transition of a system from a liquid (or solution) into a gel. Apply- 
ing the SOL gel and SOS techniques. it is possible to fabricate thin-film coatings. ceramic 
fibers and waveguide based optical amplifiers. Further to the above integration technolo- 
gies used for IP devices. a silicon-on-insulator (SOD approach is also used to produce mi- 
crowave guide bends and couplers at a reduced scale while maintaining compatibility with 
the standard silicon fabrication techniques. SOI based on CMOS technology that has revo- 
lutionized both electronic and optoelectronic integrated circuit technologies is now usefully 
applied as an IP technology. Using this integration technique. active components like opti- 
cal sources. detectors and amplifiers can be coupled to other IP devices. Furthermore. the 
more recently developed photonic crystal waveguide technology is also compatible with this 


integration technology. 
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New Words and Phrases 


amplification n. 放大; 放大 率 bulk n. (大 ) 体 积 

facilitate vt， 促 进 ， 助 长 ; 使 容易 ; 帮助 adj. 大 批 的 ， 大 量 的 

beam splitter ARE filler n. WIERE 

coupler n. HAR Etki multiplexer ?2。 多 路 调制 器 

grating n. JGA: 格 机 bistable adj. ASH) 

polarization controller 偏振 控制 器 envisage vt， 想 象 ， 设 想 ; 正视 ， 面 对 
interferometer n， 十 涉 仪 monolithic n. HJ > 整体 式 
amplifier n， 放 大 器 serial n. ARO; de. 串 行 
fabricate vt， 制 造 ; 装配 ; 捏造 adj. 人 

demultiplexing n， 多 路 解 编 comet j. 连续 的 ， 连 贯 的 
photonics n， 光 子 学 ei . Meis 发 射 ; 颁布 
confluence n. 合流; 融合 2 ted adjJ， 编 造 的 

terminology n. R, RHE WV ut. Fai fabricate 的 过 去 分 词 ) 


optoelectronic adj， 光 电子 的 buffer n. upik; 缓冲 区 


subsystem n. ARB. AE SS Mos 
monolithically adv. "f Fr VE wees Js 电镀 ; fast; Motil 


multilayered adj. ZE achive adj. 折射 的 

microfabrication n LA IL T. sad n. REG. 二 氧化 硅 

semiconduct 体 KZ vertical adj. 垂直 的 ， 坚 立 的 
ad. 将 临 期 


advent n. iH confinement n. 监禁 ,关押 ; 分 娩 


soliton n. IMF; MTF lithography n. JGA); 平版 印刷 
reinvention n. 再 创造 、 重 塑 customize vt. 定制 
solely adv. WE—3&; 仅仅 attenuator n. 训 减 器 
compatibility n. 适合 ; 互 换 性 ; 通用 性 ceramic adj. Pig gt 
responsive adj. 响应 的 ， 有 求 必 应; 应 7 陶瓷 制品 ;陶瓷 器 

叫 性 bend wt. & wi。 使 弯曲 ; 弯 管 ;弯曲 变形 ;， Sj 
ultrahigh adj.， 超 高 的 ， 特 高 的 compatibility n. A; 互 换 性 ; 通用 性 
Notes 


l. integrated optics (IO); 集成 光学 。 集 成 光学 是 研究 媒质 薄膜 中 的 光学 现象 以 及 光 
学 元 件 集成 化 的 一 门 学 科 。 它 是 在 激光 技术 发 展 过 程 中 .由 于 光 通 信 、 光 学 信息 处 理 等 的 








需要 ， 而 逐步 形成 和 发 展 起 来 的 。 
2. integrated photonics (IP): 集成 光子 学 ， 主 要 研究 光子 集成 技术 ， 即 将 光电 子 器 
件 和 电子 器 件 集 成 在 芯片 上 。 
3. subpicosecond; 亚 皮 秒 。1 秒 二 1 000 毫秒 二 1 000 000 fife — 1 000 000 000 纳 秒 一 
1 000 000 000 000 皮 秒 ; 1 皮 秒 二 一 万 亿 分 之 一 秒 。 亚 皮 秒 就 是 比 皮 秒 还 要 小 的 时 间 尺 度 。 


dr 














Exercises 


I. Translate the following phrases into Chinese. 


l. optical fiber communication «e 


integrated optics 


2 S 
3. electro-optical element xv 


4. wafer-scale 


5. thermo-optic effect RY 


6. waveguide based optical amplifi 
7. integrated photonics = 
8. soliton ron SS XP 
E 
I. Fill in the versi on word(s). fr WE below. 


























optica I confining — transmission microelectronics 
=e congestion detected hazardous 
diameters expansion fabricated visible 
deposit transfer applications extension 
copper interface passive noninvasive 
l. The use of optical carrier waves or light for communication has been com- 


mon for many years. Simple systems such as signal fires, reflecting mirrors and, more re- 


cently, signaling lamps have provided successful. if limited. information 


2. The use of circular dielectric waveguide structures for light is universally 
utilized within optical fiber communications. Both IO and IP involve an of this 
guided wave optical technology through the use of planar waveguides to confine 


and guide the light in guided wave devices and circuits. 
3. Planar waveguide structures are produced using several different techniques which have in 


large part been derived from the industry. For example. devices may be fab- 











ricated by radio-frequency sputtering to thin films of glass onto glass substrates. 

4. Optical fibers have very small which are often no greater than the diame- 
ter of a human hair. Hence. even when such fibers are covered with protective coatings 
they are far smaller and much lighter than corresponding cables. This is a 
tremendous boon towards the alleviation of duct in cities. as well as allowing for 


an of signal within mobiles such as aircraft. satellites and even ships. 





5. Optical fibers which are from glass. or sometimes a plastic polymer. are 
electrical insulators and therefore. unlike their metallic counterparts. they do not exhibit 
earth loop and problems. Furthermore. this property makes optical fiber trans- 
mission ideally suited for communication in electrically environments as the fibers 


create no arcing or spark hazard at abrasions or short 







6. The light from optical fibers does not radiate sig, ly and therefore they pro- 


vide a high degree of signal security. Unlike the situatjónywith copper cables. a transmitted 
optical signal cannot be obtained from a fiber in manner (i. e. , without drawing 
optical power from the fiber). Therefore, Qs any attempt to acquire a message sig- 


nal transmitted optically may be AME is obviously attractive for military. 







banking and general data transmission s io dim network) 3 


ices into =< 


II. Translate the followin 


1. Communication be broadly defin Kia transfer of information from one 
point to another. en the information isst, bsenveyed over any distance a communica- 
tion system is ANS Nenuited: Within a nication system the information transfer is 
frequently EN i@ved by superimposing or modulating the information onto an electromag- 


netic wave which acts as a carrier for the information signal. This modulated carrier is then 
transmitted to the required destination where it is received and the original information 
signal is obtained by demodulation. Sophisticated techniques have been developed for this 
process using electromagnetic carrier waves operating at radio frequencies as well as microwave 
and millimeter wave frequencies. However. "communication" may also be achieved using an 


electromagnetic carrier which is selected from the optical range of frequencies. 
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2. Communication using an optical carrier wave guided along a glass fiber has a num- 











ber of extremely attractive features. several of which were apparent when the technique 
was originally conceived. Furthermore. the advances in the technology to date have sur- 
passed even the most optimistic predictions. creating additional advantages. Hence it is 
useful to consider the merits and special features offered by optical fiber communications o- 


ver more conventional electrical communications. 

















Reading: M E 
The direct detection of an GN modulated co cori is basically a photon 


counting process where ech GEN photon is co} ei o an electron-hole pair (or, in 
the case of the APD, fa of pairs due to: Ne e gain). 

hoe i tection receivers. however. are generally limited by noise gener- 
ated in the deteftorJahd preamplifier ex at very high signal-to-noise ratios (SNRs). 
The es such square-law detection systems is therefore reduced below the funda- 
mental quantum noise limit by at least 10 to 20 dB. This is particularly the case at longer 
wavelengths (i. e. . 1. 3 to 1. 64m) and at higher transmission rates since the electronic pre- 
amplifier usually has a rising input optical power with frequency requirement. For a good 
APD receiver operating in the wavelength range 1. 3 to 1. 6m this corresponds to between 
700 and 1 000 photons per bit required to maintain a bit-error-rate (BER) of 107°. 

Improvements in receiver sensitivity. together with wavelength selectivity, may be ob- 
tained using the well-known coherent detection techniques (i.e. . heterodyne and homo- 
dyne detection) for the optical signal. Unlike direct detection in which the optical signal is 
converted directly into a demodulated electrical output, such coherent optical receivers first 
add to the incoming optical signal from a locally generated optical wave prior to detecting 
the sum. The resulting photocurrent is a replica of the original signal which is translated 
down in frequency from the optical domain (around 105 GHz) to the radio domain (up to 


several GHz) and where conventional electronic techniques can be used for further signal 
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processing and demodulation. Hence an ideal coherent receiver operating in the 1.3 to 
1. 65m wavelength region requires a signal energy of only 10 to 20 photons per bit to a- 
chieve a BER of 107°. Hence coherent detection potentially provides a substantial benefit 
for high-speed systems operating at longer wavelengths. A possible improvement in receiv- 
er sensitivity using conventional coherent detection of up to 20 dB can be obtained over di- 
rect detection. Furthermore, such enhanced receiver sensitivity could translate into increa- 
ses in repeater spacings of the order of 100 km or more when using low-loss fiber at a 
wavelength of 1. 555m. Hence, the improved sensitivity of 5 to 20 dB which results from 
the photomixing gain in the coherent receiver could provide: 


(1) Increased repeater spacings for both inland and undersea fransmission systems. 







(2) Higher transmission rates over existing routes withopt jng repeater spacings. 
(3) Increased power budgets to compensate for do 


tical multiplexer/demultiplexer devices in distributio: 


jated with couplers and op- 


(4) Improved sensitivity to optical test equi Xe" optical time domain reflecto- 


meters, 






Although possible increases in the ^93 distance between repeaters created the 


initial impetus for the pursuit of ransmission within optical fiber communica- 
tions. by 1990, before the w em deployment of usi it was also per- 
ceived that coherent techni eda enable a fur; äðsive step to be taken in the ex- 
ploitation of the transm capacity of optics fj i This improvement would be 
facilitated by S wavelength selectivit orded by the coherent receiver allowing 
efficient access ast optical bandw available in single-mode fibers. For example. 
it was NEN or the low-loss window between 1. 3 and 1. 65m being over 50 000 GHz 
that coherent transmission would permit wavelength division multiplexing of huge channel 


numbers with channel spacings of only a few hundred megahertz. 





The successful introduction into the optical telecommunication network of erbium- 
doped fiber amplifiers in the early 1990s. however. caused the pursuit of coherent trans- 
mission to be virtually discontinued as both improved direct detection receiver sensitivity 
through amplification and hence increased transmission distance together with dense wave- 
length division multiplexing with channel spacings of 50 GHz (0. 4 nm) or even 12. 5 GHz 
(0. 1 nm) could be facilitated. 

The modulation formats that may be employed within coherent optical fiber communi- 
cations are essentially the same as those used in coherent electrical line and radio communi- 
cations. Modulation formats of this type were discussed in later parts in a slightly different 
context. namely the generation of subcarriers for electrical frequency division multiplexing 


prior to intensity modulation of the optical source. In these cases direct detection of the op- 








tical signal is carried out at the receiver with subsequent electrical demodulation for the 
subcarriers. Such systems only provide improvements in the SNR over baseband IM/DD 
systems at the expense of a substantial bandwidth penalty. When a narrow-linewidth injec- 
tion laser (less than 1 MHz) is used in an optical fiber communication system. however, it 
is possible to directly modulate the coherent optical carrier in amplitude (direct AM), fre- 
quency (direct FM) and phase (direct PM) prior to demodulation using a coherent optical 
receiver. In the case of digital transmission this implies amplitude. frequency or phase shift 
keying (i. e. , ASK, FSK or PSK) modulation techniques. 

Direct modulation for coherent optical fiber transmission is provided using an external 


modulator which is fed by a semiconductor laser source operating in continuous wave 
mode. In electrical digital communications. coherent demodu dni recovery of the 
carrier frequency signal through either heterodyne or mee Coherent optical 
communications, however, has had a wider usage Xe erminology. An optical fiber 
communication system was referred to as coher here was optical signal mixing e- 
ven though carrier recovery may not DE optical coherent system could employ 


a demodulator that did not perform carrie: ery but used noncoherent or envelope de- 


tection. In this context differential ph&se\shift keying (DPSK) is considered a noncoherent 
electrical modulation technique b as generally boop er! to as a coherent optical 
communication modulation fpjrüet. "Moreover. a c At tical receiver is referred to as 

we lepending upon whether it operates 


synchronous or asynchrogous (i. e. ，nonsynch 
with or without ee tracking, respecti ND ater asynchronous receiver normally 
一 
employs power erfvtlope detection. X 

Although «3 indicated above, eios in coherent optical fiber communications effec- 
tively ceased in the early 1990s as a result of the successful introduction of fiberamplifiers, 
DPSK using. in particular, asynchronous detection started to receive renewed interest fol- 
lowing specific experimental demonstrations in 2002. In the context of the renewed focus 
these high-performance. long-haul systems with return-to-zero DPSK transmission. where 
a phase-to-intensity conversion takes place at the receiver prior to a direct detection 
process, have subsequently been widely demonstrated. As optical mixing with an inde- 


pendent optical signal does not necessarily take place at the receiver to achieve carrier re- 





covery, such systems tend to be referred to as being optically phase-modulated or phase 
shift keyed. or when appropriate. as self-coherent. 
The term coherent is then often utilized for the case of coherent detection in which the 


phase reference at the receiver is typically provided by a local oscillator laser that beats 





with the received optical signal to produce constructive and destructive interference. There 


is an increasing adoption of this stricter approach to the use of the terminology and there- 
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fore the paper title reflects this trend by incorporating both the "coherent" and "phase- 
modulated" system terms in order to encompass systems using advanced modulation formats 
with direct detection. and also to emphasize the growth of phase modulation as the currently 
preferred advanced optical modulation format in competition with intensity modulation. 

While prior to the 1990s a major factor in the pursuit of coherent and phase-modulated 
optical fiber systems had been the perceived improvement in receiver sensitivities to enable 
transmission over longer distances. a main focus of the renewed interest since 2002 has 
been to increase spectral efficiency. Moreover. direct detection phase-modulated optical fi- 
ber systems do also provide for a sensitivity gain over IM/DD systems. albeit at more mod- 


est levels than when using synchronous detection. In particular, it,should be noted that the 


theoretical sensitivity gains which can be determined for both PS d DPSK over IM/DD 
are substantially reduced when optical amplifiers are depl ithin the latter systems. 
New Words and Phrases N 


coherent adj. 一致 的 ; 连贯 的 RO -种 掺 杂技 术 ) 

photon n， 光 子 ， 光 量子 NS subcarriers n， 子 载波 

preamplifier n. prp san penalty Ca 刑罚 ; 害处 ; 足球 点 球 
inewidth WATER; 行 宽 ; 线 幅 ; 线 宽 


quantum n. 量子; 定 line 
heterodyne n. 52i; ph. 卜 差 振荡 器 — dj XK n. TEA; 注射 剂 





homodyne n. Hees 司 步 检 波 Chronous adj. HHY 

demodulated 调 的 eom n. 光纤 放大 器 ， 光 放大 器 

visse N 电流 renewed adj. 更 新 的 ; 重建 的 

replica n， 复 制品 vt. & vi. (使 ) 复 原 , (使) 更 新 (ve- 

reflectometer n， 反 射 计 new 的 过 去 分 词 ) 

impetus n， 动 力 ; 促进 ; 势头 ; 声势 long-haul adj. 长 途 的 

exploitation n， 开 发 ; 利用; 剥削 ; 广告 encompass u. (HSE. 包围 ; 包含 或 包括 某 
推销 事物 ; 完成 




















erbium-doped n. HZ CEFET ZPA deploy vt. & vi. 使 展开 ; 施展 ; 有 效 地 利用 





Exercises 


I. Answer the following questions. 


1. What's the direct detection of an intensity-modulated optical carrier? 
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2. What could the improved sensitivity of 5 to 20 dB provide? 
3. What are the common phase shift keying methods? 


I. Translate the following sentences into Chinese. 


l. Improvements in receiver sensitivity. together with wavelength selectivity, may be 
obtained using the well-known coherent detection techniques (i. e. . heterodyne and homo- 
dyne detection) for the optical signal. Unlike direct detection in which the optical signal is 
converted directly into a demodulated electrical output. such coherent optical receivers first 
add to the incoming optical signal from a locally generated optical wave prior to detecting 


the sum. 

















2. Direct modulation for coherent OS transmission is provided using an ex- 


ternal modulator which is fed by a sei üctor laser source operating in continuous wave 
mode. In electrical digital FUA ns. coherent dei lation requires recovery of the 
carrier frequency M DA ither ME. ne detection. 








E =< 











3. The direct detection of an intensity-modulated optical carrier is basically a photon 
counting process where each detected photon is converted into an electron-hole pair (or, in 


the case of the APD, a number of pairs due to avalanche gain). 























UNIT 13 OPTICAL FIBER COMMUNICATION 


Abstract Reading 





Optical Memory Made of Photonic Crystal Working over the C-band of ITU 


After several decades pushing the technology and the development of the world the e- 
lectronics is giving space for technologies that use light. We propose and analyze an optical 


memory embedded in a nonlinear Photonic Crystal (PhC), whose system of writing and 






reading of data is controlled by external optical command si; is optical memory is 


based on two optical Nonlinear Directional Couplers (N which are connected to a 
shared ring. These NDCs have a small coupling ord e optical memory can works 
-Band of ITU (International Tel- 


to 1 565 nm), 





in an all-optical broadband telecommunications s 
ecommunication Union)—wavelength ANA 


Characterization and Training of a ment Piezoelectric Deformable Mirror for 


Lensing XM 
Characterizing E response XA ional parameters of a deformable 


mirror is a critical step in the design ofan o! aptive optics system. This paper describes 
the characterization AS training of a pié eS deformable mirror (PDM) to be imple- 
Adaptive Optics (AQ) system at Short Wave Infrared (SWIR) 


wavelengths for free-space optical communications systems. The data were analyzed using 


mented on a | 


commercial and customized software. 


Algorithmical Analysis of Information-theoretic Aspects of Secure Communication 
over Optical-fiber Quantum Channels 


The information-theoretic security of optical-fiber based quantum communication is the 
fundamental question of quantum cryptography. Quantum cryptographic schemes use pho- 
tons as information carriers. The physical properties of photons make it possible to use 
quantum bits to realize unconditionally secure quantum communication over the current 
standard optical fiber network. Quantum cryptography is one of the most important and 
advanced fields in the area of quantum information processing. This paper analyzes the in- 


formation-theoretic security of the most important and prevalent optical-fiber based QKD 








schemes, such as BB84. Six-state and DPS QKD schemes. using efficient information geo- 
metric approaches. We study the information-theoretic impacts of the most general eaves- 
dropping attacks against these protocols using efficient algorithms. Currently. the ability 
to perform these attacks is well beyond today's technological capabilities; however. in the 
future. these types of attacks can be used to eavesdrop on quantum communications over 
optical fibers. The information-theoretic security of these protocols is analyzed by informa- 
tion geometric algorithms and abstract geometrical objects. To describe the security of the 
protocols. we introduce the quantum informational ball representation. and we discover the 
connection between the length of the optical fiber and the radius of the quantum informa- 
tional ball. For practical reasons, we will also demonstrate our algorithm for the DPS QKD 


protocol, 





unit 14 


COMPUTER NETWORKS 


Text: Business Applications of x er Networks 


Most companies have a substantial EX be S For example. a company. 


may have a computer for each worker a A n to design products, write brochures. 






and do the payroll. Initially. some o i ce din may have worked in isolation from 


the others, but at some point. mą nt may have de 1 to connect them to be able to 
distribute information thro € company. 
Put in slightly mo en form. the issue Rage NL us sharing. The goal is to 


make all programs% equipment, and especiall n = it to anyone on the network 





urce or the user. An obvious and wide- 





without CN x t Kee location 
spread exam Ma a group of office Workers share a common printer. None of the in- 
dividuals rea PA needs a private printer. and a high-volume networked printer is often chea- 
per. faster. and easier to maintain than a large collection of individual printers. 

However. probably even more important than sharing physical resources such as 
printers, and tape backup systems, is sharing information. Companies small and large are 
vitally dependent on computerized information. Most companies have customer records. 
product information. inventories. financial statements, tax information, and much more 
online. If all of its computers suddenly went down. a bank could not last more than five 
minutes. A modern manufacturing plant, with a computer-controlled assembly line, would 
not last even 5 seconds. Even a small travel agency or three-person law firm is now highly 
dependent on computer networks for allowing employees to access relevant information and 
documents instantly. 

For smaller companies. all the computers are likely to be in a single office or perhaps a 


single building. but for larger ones. the computers and employees may be scattered over 











dozens of offices and plants in many countries, Nevertheless. a sales person in New York 
might sometimes need access to a product inventory database in Singapore. Networks 
called VPNs (Virtual Private Networks) may be used to join the individual networks at dif- 
ferent sites into one extended network. In other words. the mere fact that a user happens 
to be 15 000 km away from his data should not prevent him from using the data as though 
they were local. This goal may be summarized by saying that it is an attempt to end the 
"tyranny of geography". 

In the simplest of terms. one can imagine a company's information system as consis- 
ting of one or more databases with company information and some number of employees 
who need to access them remotely. In this model. the data are stored on powerful comput- 


ers called servers. Often these are centrally housed and main a system administra- 








tor. In contrast. the employees have simpler machines. ci ients, on their desks, with 


which they access remote data, for example, to incl 'eadsheets they are construc- 
ting. (Sometimes we will refer to the human us lient machine as the "client", but 
it should be clear from the context ae’ the computer or its user. ) The client 


and server machines are connected by a BON ie that we have shown the network as 






a simple oval. without any detail. tise ioa form when we mean a network in the 


most abstract sense. eb il is required. kad provided. This whole ar- 
rangement is called the dap 7 Wd model. It is Me and forms the basis of much 
network usage. The mo* A realization is thaKof a Web application, in which the 
server generates pages based on its Ne response to client requests that may 
he client-server s applicable when the client and server are 


update the NS 
both in the RN uilding (and belong to the same company) . but also when they are far a- 


part. For example. when a person at home accesses a page on the World Wide Web. the 





same model is employed. with the remote Web server being the server and the user's per- 
sonal computer being the client. Under most conditions. one server can handle a large 
number (hundreds or thousands) of clients simultaneously. 

If we look at the client-server model in detail. we see that two processes (i. e. . run- 
ning programs) are involved. one on the client machine and one on the server machine. 
Communication takes the form of the client process sending a message over the network to 
the server process. The client process then waits for a reply message. When the server 
process gets the request. it performs the requested work or looks up the requested data and 
sends back a reply. A second goal of setting up a computer network has to do with people 
rather than information or even computers. A computer network can provide a powerful 
communication medium among employees. Virtually every company that has two or more 


computers now has email (electronic mail), which employees generally use for a great deal 











of daily communication. In fact. a common gripe around the water cooler is how much 
email everyone has to deal with. much of it quite meaningless because bosses have discov- 
ered that they can send the same (often content-free) message to all their subordinates at 
the push of a button. Telephone calls between employees may be carried by the computer 
network instead of by the phone company. This technology is called IP telephony or Voice 
over IP (VoIP) ™ when Internet technology is used. The microphone and speaker at each 
end may belong to a VoIP-enabled phone or the employee's computer. 

Companies find this a wonderful way to save on their telephone bills. Other. richer 
forms of communication are made possible by computer networks. Video can be added to 


audio so that employees at distant locations can see and hear each other as they hold a 


meeting. This technique is a powerful tool for eliminating th d time previously de- 
voted to travel. Desktop sharing lets remote workers see ract with a graphical com- 
puter screen, This makes it easy for two or more p who work far apart to read and 
write a shared blackboard or write a report tog en one worker makes a change to 


an online document. the others can see ASA mediately, instead of waiting several 
days for a letter. Such a speedup makes/too ion among far-flung groups of people easy 


where it previously had been im ore ambitious forms of remote coordination 






such as telemedicine are only no 
~ 


Stating to be used fidam patient monitoring) 
but may become much more rtant, It is M nis hat communication and trans- 





portation are having a rde. and whichever wins wi ake the other obsolete. 
A third goal fo^ any companies is doi s electronically. especially with cus- 
一 


tomers and syppliery.* This new model is e-commerce (electronic commerce) and it 
has grown AU. recent years. Airlines. bookstores. and other retailers have discov- 
ered that many customers like the convenience of shopping from home. Consequently. 
many companies provide catalogs of their goods and services online and take orders online. 
Manufacturers of automobiles. aircraft, and computers. among others. buy subsystems 
from a variety of suppliers and then assemble the parts, Using computer networks. manu- 
facturers can place orders electronically as needed. This reduces the need for large invento- 


ries and enhances efficiency. 


New Words and Phrases 





brochure n， 小 册子 ， ovaladj. 椭圆 形 的 

payroll n， 工 资 单 n. 椭圆 形 

tyranny n. 暴虐 ; 专横 subordinate adj. 级 别 或 职位 较 低 的 ; 下 级 
spreadsheet n， 电 子 表格 的 ; 次 要 的 ; 附属 的 











LAB MENTEERAUR 





n. 部 属 ; 部 下 ， 下 级 obsolete adj. 废弃 的 ; 老式 的 
vt, fi 居 下 位 ， 使 在 次 级 ; 7.。 废 词 ， 陈腐 的 人 
使 服从 ; 使 从 属 vt. HW: 废弃 


far-flung adj. 遥远 的 ,广泛 的 
telemedicine n. (通过 遥测 、 电 话 、 电 视 等 
手段 求 诊 的 ) 远 距离 医学 





Notes 


1. VoIP; Voice over Internet Protocol， 简 而 言 之 就 是 将 模拟 信号 (Voice) 数 字 化 ， 以 
数据 封包 (Data Packet) 的 形式 在 IP 网 络 (IP Network) F Sch er 最 大 的 优势 是 能 
广泛 地 采用 Internet 和 全 球 IP 互 连 的 环境 ,提供 ed 更 好 的 服务 。VoIP 可 
以 在 IP 网 络 上 便宜 地 传送 语音 、 传 真 、 视 频 和 数据 等 0 统一 消息 业务 、 虚 拟 电 话 、 
虚拟 语音 /传真 邮箱 、 查 号 业务 、Internet 呼叫 中 站 ernet 呼叫 管理 、 电 话 视频 会 议 、 
电子 商务 、 I b 


Exercises SS < 


I. Translate the aA into Chine: A. 
l. thigh-voludt fet orked printer V 
x 


2. tape adeupl K 
3; mex line 
4. Virtual Private Networks 

. client-server 


5 
6. e-commerce 
7. 


. telemedicine 


I. Fill in the blanks with missing word(s) from the table below. 























subnet behalf determines telecommunications 
datagrams optical representing components 
centralized destination emphasize implemented 
independent layer allocation hierarchy. 
protocols allocation transmitted transmission 
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1. Dynamic methods for a common channel are either or decentral- 
ized. In the centralized channel method. there is a single entity. for example. the 
base station in cellular networks. which who goes next. 

2. In most WANs, the con: 


and switching elements. lines move bits between machines. They can be made of 





s of two distinct : transmission lines 


copper wire, fiber. or even radio links. Most companies do not have transmission 
lines lying about. so instead they lease the lines from a company. 

3. Services and are distinct concepts. This distinction is so important that 
we it again here. A service is a set of primitives Coperations) that a layer provides 


form on of its users, but it says nothing at all al hese operations are 


to the layer above it, The service defines what operations the is prepared to per- 






$ e physical property such as 


r current as a single-valued func- 


4. Information can be on wires by va 
voltage or current. By the value of this 


tion of time, f(t), we can model the MEM signal and analyze it mathematically. 


5. Together with the network lay the. or layer is the heart of the protocol 
. The network layer provides o-end packet delivery using — — or virtual 
circuits. The transport WS ui the network du rovide data transport from a 





process on a source cit process on a Xn with a desired level of reli- 
ability that is h 2 ii networks eurpently in use. 


Translat die lowing cone inese. 


. PANS w Area Networks) f^ devices communicate over the range of a per- 
son. A common example is a wireless network that connects a computer with its peripher- 
als. Almost every computer has an attached monitor. keyboard. mouse. and printer. 
Without using wireless. this connection must be done with cables. So many new users have 
a hard time finding the right cables and plugging them into the right little holes (even 
though they are usually color coded) that most computer vendors offer the option of send- 


ing a technician to the user's home to do it. 

















2. Itis worth spending a little more time discussing LANs in the home. In the future. 


it is likely that every appliance in the home will be capable of communicating with every 








other appliance. and all of them will be accessible over the Internet. This development is 
likely to be one of those visionary concepts that nobody asked for (like TV remote controls 
or mobile phones). but once they arrived nobody can imagine how they lived without 


them. 

















3. To reduce their design complexity. most networks are organized as a stack of lay- 


ers or levels, each one built upon the one below it. The numl ers. the name of each 
layer. the contents of each layer. and the function of each iffer from network to net- 
work. The purpose of each layer is to offer certain si s tò the higher layers while shiel- 






ding those layers from the details of how the of SeiWices are actually implemented. In 


a sense, each layer is a kind of virtual ess 
it. 


ing certain services to the layer above 

















Reading: Home Applications 





In 1977. Ken Olsen was president of the Digital Equipment Corporation. then the 
number two computer vendor in the world (after IBM). When asked why Digital was not 
going after the personal computer market in a big way. he said; "There is no reason for 
any individual to have a computer in his home. " History showed otherwise and Digital no 
longer exists. People initially bought computers or word processing and games. Recently. 
the biggest reason to buy a home computer was probably for Internet access. Now, many 
consumer electronic devices, such as set-top boxes, game consoles, and clock radios, come 
with embedded computers and computer networks, especially wireless networks. and home 
networks are broadly used for entertainment. including listening to. looking at. and crea- 
ting music. photos. and videos. 


Internet access provides home users with connectivity to remote computers. As with 
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companies. home users can access information. communicate with other people. and buy 
products and services with ecommerce. The main benefit now comes from connecting out- 
side of the home. Bob Metcalfe. the inventor of Ethernet. hypothesized that the value of a 
network is proportional to the square of the number of users because this is roughly the 
number of different connections that may be made (Gilder, 1993). This hypothesis is 
known as “Metcalfe’s law". It helps to explain how the tremendous popularity of the Inter- 
net comes from its size. 

Access to remote information comes in many forms. It can be surfing the World Wide 
Web for information or just for fun. Information available includes the arts. business. 
cooking. government, health, history. hobbies, recreation, sciepce. sports, travel, and 


many others. Fun comes in too many ways to mention, plus s ys that are better left 







unmentioned. S 
Many newspapers have gone online and can be 2 


ng about corrupt politicians. big 


d. For example, it is some- 






times possible to tell a newspaper that you wan 
fires, scandals involving celebrities, and epid ut no football. thank you. Sometimes 
it is possible to have the selected articles áded to your computer while you sleep. As 


this trend continues. it will cause unemployment among 12-year-old paperboys. 


ion has always A weakest link in the whole 
production chain. Of coursey ake this MET. will first have to figure out 
how to make money og world. somet entirely obvious 
expect everything ef E 
p bPyond newspapers cesi planes and scientific journals) is the online 


The next’ 
digital library. Many professional organizatio 


the IEEE Computer Society (www. computer. org). already have all their journals and 






but newspapers like it mn 


since Internet users 








s. such as the ACM (www. acm. org) and 





conference proceedings online. Electronic book readers and online libraries may make prin- 
ted books obsolete. Skeptics should take note of the effect the printing press had on the 
medieval illuminated manuscript. Much of this information is accessed using the client- 
server model, but there is different, popular model for accessing information that goes by 
the name of peer-to-peer communication). In this form, individuals who form a loose group 
can communicate with others in the group. Every person can, in principle, communicate 
with one or more other people; there is no fixed division into clients and servers. Many 
peer-to-peer systems, such BitTorrent (Cohen, 2003), do not have any central database of 
content. Instead, each user maintains his own database locally and provides a list of other 
nearby people who are members of the system. A new user can then go to any existing 
member to see what he has and get the names of other members to inspect for more content 


and more names. This lookup process can be repeated indefinitely to build up a large local 
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database of what is out there. 


It is an activity that would get tedious for people but computers excel at it. Peer-to- 
peer communication is often used to share music and videos. It really hit the big time a- 
round 2000 with a music sharing service called Napster that was shut down after what was 
probably the biggest copyright infringement case in all of recorded history (Lam and Tan. 
2001; and Macedonia. 2000). Legal applications for peer-to-peer communication also ex- 
st. These include fans sharing public domain music. families sharing photos and movies. 
and users downloading public software packages. In fact. one of the most popular Internet 
applications of all. email. is inherently peer-to-peer. This form of communication is likely 
to grow considerably in the future. 

All of the above applications involve interactions betwee; deo and a remote data- 
base full of information. The second broad category of k use is person-to-person 
communication. basically the 21* century's answer an "century's telephone. E-mail 
is already used on a daily basis by millions of p Xs the world and its use is grow- 
ing rapidly. It already routinely contains EV. eo as well as text and pictures, Smell 
may take a while. Any teenager worth hi SM is addicted to instant messaging, This 
facility. derived from the UNIX tal 






Tn in use since around 1970. allows two people 


to type messages at each othe 


in ne. There are den messaging services too. 
such as the Twitter service ets people NES! messages called "tweets" to 


their circle of friends iin willing audiences 
The Internet be Used by lai Des audio (e. g. . Internet radio stations) 


and video (e. Tube). Besides bei Cheap way to call to distant friends, these ap- 
plications NCH rich experiences such as telelearning, meaning attending 8 A. M. 
classes without the inconvenience of having to get out of bed first. In the long run, the use 
of networks to enhance human-to-human communication may prove more important than 
any of the others. It may become hugely important to people who are geographically chal- 
lenged. giving them the same access to services as people living in the middle of a big city. 

Between person-to-person communications and accessing information are social net- 
work applications. Here. the flow of information is driven by the relationships that people 
declare between each other. One of the most popular social networking sites is Facebook. 
It lets people update their personal profiles and shares the updates with other people who 
they have declared to be their friends. 

Other social networking applications can make introductions via friends of friends. 
send news messages to friends such as Twitter above. and much more. Even more loosely. 
groups of people can work together to create content. A wiki. for example. is a collabora- 


tive Web site that the members of a community edit. The most famous wiki is the Wikipe- 








dia. an encyclopedia anyone can edit. but there are thousands of other wikis. 

Our third category is electronic commerce in the broadest sense of the term. Home 
shopping is already popular and enables users to inspect the online catalogs of thousands of 
companies, Some of these catalogs are interactive. showing products from different view- 
points and in configurations that can be personalized. After the customer buys a product e- 
lectronically but cannot figure out how to use it. online technical support may be consul- 
ted. 

Another area in which e-commerce is widely used is access to financial institutions. 
Many people already pay their bills. manage their bank accounts. and handle their invest- 


ments electronically. This trend will surely continue as networks become more secure. 







One area that virtually nobody foresaw is electronic flea Ce-flea). Online auc- 


tions of second-hand goods have become a massive ind Unlike traditional e-com- 
merce, which follows the client-server model. onlin ionis are peer-to-peer in the sense 
that consumers can act as both buyers and sellers, f these forms of e-commerce have 


acquired cute little tags based on the fact th d “2” are pronounced the same. 






Our fourth category is entertainme as made huge strides in the home in re- 






cent years. with the distribution of iQ.) Tadio and television programs. and movies over 


the Internet beginning to riva] t traditional ects Users can find, buy, and 
download MP3 songs and uality movies no to their personal collection. 
1 








TV shows now reach mes via IPTV ( sion) systems that are based on IP 


Media streaming applications let us- 





technology instea ablé TV or radio pi s 
- 


ers tune into ndi?radio stations or i Tecent episodes of their favorite TV shows. 





Naturally. all s content can be moved around your house between different devices. 
displays and speakers. usually with a wireless network. 

Soon, it may be possible to search for any movie or television program ever made, in 
any country. and have it displayed on your screen instantly. New films may become inter- 
active. where the user is occasionally prompted for the story direction ( should Macbeth 
murder Duncan or just bide his time?) with alternative scenarios provided for all cases. 
Live television may also become interactive. with the audience participating in quiz shows. 
choosing among contestants. and so on. 

Another form of entertainment is game playing. Already we have multiperson real- 
time simulation games. like hide-and-seek in a virtual dungeon. and flight simulators with 
the players on one team trying to shoot down the players on the opposing team. Virtual 
worlds provide a persistent setting in which thousands of users can experience a shared re- 
ality with three-dimensional graphics. 


Our last category is ubiquitous computing. in which computing is embedded into eve- 
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ryday life. as in the vision of Mark Weiser (1991). Many homes are already wired with se- 
curity systems that include door and window sensors. and there are many more sensors 
that can be folded in to a smart home monitor. such as energy consumption. Your electrici- 
ty. gas and water meters could also report usage over the network. This would save money 
as there would be no need to send out meter readers. And your smoke detectors could call 
the fire department instead of making a big noise Cwhich has little value if no one is home). 
As the cost of sensing and communication drops. more and more measurement and repor- 
ting will be done with networks. 

Increasingly. consumer electronic devices are networked. For example. some high-end 
cameras already have a wireless network capability and use it to send photos to a nearby 
display for viewing. Professional sports photographers can als their photos to their 
editors in real-time, first wirelessly to an access point de thé Internet. Devices such 

e 


as televisions that plug into the wall can use pow tworks to send information 


throughout the house over the wires that carry XQ may not be very surprising to 
have these objects on the network. but XR e do not think of as computers may 
sense and communicate information too, o AS your shower may record water us- 
age, give you visual feedback while yo Er up, and report to a home environmental mo- 
nitoring application when you ar to help save on ater bill. 

A technology called Ri adio Frequency ion) will push this idea even 


further in the future. ees are passive have no battery) chips the size of 
stamps and they c. be affixed to T pets, cred it cards, and other 
items in the un out. This lets Ri ae locate and communicate with the items 
over a distance)afup to several meters. depending on the kind of RFID. Originally. RFID 
was commercialized to replace barcodes. It has not succeeded yet because barcodes are free 
and RFID tags cost a few cents. Of course. RFID tags offer much more and their price is 


rapidly declining. They may turn the real world into the Internet of things (ITU, 2005). 


New Words and Phrases 





set-top n. 机 顶 vt. 淘汰 ; 废弃 

console n. 控制 台 ; 操纵 台 skeptic n. 怀疑 论 者 ; 怀疑 者 
hypothesize vt. 假设 .假定 medieval adj. 中 世纪 的 ; 原始 的 
Metcalfe's law 梅 特 卡 夫 定 律 peer-to-peer n. 点 对 点 

scandal». 丑闻 ; fir aia: 诽谤 ; 公愤 illuminate or. 阐明 ， 说 明 
epidemic adj. 流行 的 ; 传染 性 的 manuscript n. 手稿 ; 原稿 











obsolete adj. 废弃 的 ; 老式 的 inherently adv. 内 在 地 ; 固有 地 ; 天 性 地 
































derive vt. Wi T: A stride n. AE; 步 幅 ; 进展 
collaborative adj. 合作 的 ， 协 作 的 vi. Pit; 大 踏步 走 过 
configuration n. 配置; 结构 ;外形 contestant n. 竞争 者 ; 争辩 者 
auction vt. 拍卖 ; 竞 卖 simulation n. 仿真 ; 模拟 

n. È ubiquitous adj. 普遍 存在 的 ; 无 所 不 在 的 


affix ut. FRE; 署名 


Exercises 


I. Answer the following questions. 


l. List some consumer electronic devices. x 
2. When was e-mail first put in use? s$ 
~ 


3，What canTwitter service do? 





I. Translate the following sentei 







l. Internet access provides users with con 
iN . " 
with companies, home users’ access informati 


buy products and servic ith e-commerce. T! 


outside of AUR x 


to remote computers. As 
nicate with other people. and 


enefit now comes from connecting 

















2. Between person-to-person communications and accessing information are social net- 
work applications. Here. the flow of information is driven by the relationships that people 
declare between each other. One of the most popular social networking sites is Facebook. 
It lets people update their personal profiles and shares the updates with other people who 


they have declared to be their friends. 
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3. Increasingly. consumer electronic devices are networked. For example. some high- 
end cameras already have a wireless network capability and use it to send photos to a near- 
by display for viewing. Professional sports photographers can also send their photos to 
their editors in real-time. first wirelessly to an access point then over the Internet. Devices 
such as televisions that plug into the wall can use power-line networks to send information 


throughout the house over the wires that carry electricity. 

















Abstract Re Qs 
ki 


Algorithms for Enhanced S nce Coordination (eICIC) in LTE HetNets 


The success of LTE Interns networks &) with macrocells and picocells 
critically depends on Mas spectrum sharin, n high-power macros and low-power 


picos. Two ma lenges in this ci X : (D determining the amount of radio 
resources tha acrócells should offer & cells, and (2) determining the association 
rules that deci hich user equipments (UEs) should associate with picos. In this paper. 
we develop a novel algorithm to solve these two coupled problems in a joint manner. Our 
algorithm has provable guarantee. and furthermore, it accounts for network topology. 
traffic load, and macro-pico interference map. Our solution is standard compliant and can 
be implemented using the notion of Almost Blank Subframes (ABS) and Cell Selection Bias 
(CSB) proposed by LTE standards. We also show extensive evaluations using RF plan 
Írom a real network and discuss self-optimized networking ( SON)-based enhanced inter- 


cell interference coordination (eICIC) implementation. 
Fine Comb: Measuring Microscopic Latency and Loss in the Presence of Reordering 
Modern stock trading and cluster applications require microsecond latencies and almost 


no losses in data centers. This paper introduces an algorithm called FineComb that can ob- 


tain fine-grain end-to-end loss and latency measurements between edge routers in these net- 
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works. Such a mechanism can allow managers to distinguish between latencies and loss sin- 
gularities caused by servers and those caused by the network. Compared to prior work. 
such as Lossy Difference Aggregator (LDA). which focused on switch-level latency meas- 
urements. the requirement of end-to-end latency measurements introduces the challenge of 
reordering that occurs commonly in IP networks due to churn. The problem is even more 
acute in switches across data center networks that employ multipath routing algorithms to 
exploit the inherent path diversity. Without proper care. a loss estimation algorithm can 
confound loss and reordering; furthermore. any attempt to aggregate delay estimates in the 
presence of reordering results in severe errors, FineComb deals with these problems using 


han LDA in loss and 


order-agnostic packet digests and a simple new idea we call stash recovery. Our evaluation 
demonstrates that FineComb is orders of magnitude more a A 


delay estimates in the presence of reordering. « 
Infrastructure-free Content-based rae 


Peer-to-peer (P2P) networks can is to distributed content-based publish/ 


subscribe data dissemination systems. articular, since a P2P network's aggregate re- 


4 





sources grow as the number o; nts increase: oU can be achieved using no 
. S - B . H 
infrastructure other than th icipants' own NE his paper proposes algorithms 


sed publish/subscrib 








Íor supporting content- ich subscriptions can specify a range 


of interest and pu tions a range of values, algorithms are built over a distributed 
一 

hash table a NS and are completel tralized. Load balance is addressed by sub- 

scription delégatón away from overloaded peers and a bottom-up tree search technique that 


avoids root hotspots. Furthermore. fault tolerance is achieved with a lightweight replica- 
tion scheme that quickly detects and recovers from faults. Experimental results support the 
scalability and fault-tolerance properties of the algorithms: for example. doubling the num- 
ber of subscriptions does not double internal system messages. and even the simultaneous 


failure of 20% of the peers in the system requires less than 2 min to fully recover. 





uur 15 


DIGITAL COMMUNICATION 


Text: Introduction to Digital C unication 


NE 


Communication has been one of the deepest the human race throughout. recorded 






history. It is essential to forming social uni tating the young. and to expressing a myri- 


ad of emotions and needs, Good commi is central to a civilized society. 









The various communication lines in engineeri have the purpose of providing 
technological aids to hum. ce ication, One d the smoke signals and drum 
rolls of primitive socie ng technological i á rene. but communica- 





it today beca v ant with telegraphy, then telephony, 


tion technology po M vi 
then video. ice communicati a oday the amazing mixture of all of these in 
shd 


inexpensive, ^S portable devices. 





Initially these technologies were developed as separate networks and were viewed as having 
little in common. As these networks grew, however, the fact that all parts of a given network 
had to work together, coupled with the fact that different components were developed at different 
times using different design methodologies, caused an increased focus on the underlying principles 
and architectural understanding required for continued system evolution. 

This need for basic principles was probably best understood at American Telephone 
and Telegraph (AT&T) where Bell Laboratories was created as the research and develop- 
ment arm of AT&.T. The Math Center at Bell Labs became the predominant center for 
communication research in the world. and held that position until quite recently. 

The central core of the principles of communication technology was developed at that 
center, Perhaps the greatest contribution from the Math Center was the creation of Infor- 
mation Theory by Claude Shannon. For perhaps the first 25 years of its existence, Infor- 


mation Theory was regarded as a beautiful theory but not as a central guide to the architec- 
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ture and design of communication systems. After that time, however. both the device 
technology and the engineering understanding of the theory were sufficient to enable sys- 
tem development to follow information theoretic principles. 

A number of information theoretic ideas and how they affect communication system 
design will be explained carefully in later parts. One pair of ideas. however. is central to 
almost every topic. The first is to view all communication sources. e.g. . speech wave- 
forms, image waveforms, and text files, as being representable by binary sequences. The 
second is to design communication systems that first convert the source output into a bina- 
ry sequence and then convert that binary sequence into a form suitable for transmission o- 
ver particular physical media such as cable. twisted wire pair. optieal fiber. or electromag- 
netic radiation through space. 

Digital communication systems. by definition. are nication systems that use 
such a digital sequence as an interface between the s ys the channel input Cand simi- 
larly between the channel output and final desti 

The idea of converting an analog MES oa AU sequence was quite revolu- 


tionary in 1948, and the notion that this/& od Ie oe before channel processing was even 






more revolutionary. Today. with di Meras, digital video. digital voice, etc. . the i- 


dea of digitizing any kind oi a US dh e ipe the most technophobic. 
The notion of a binary i meti petore channel SE almost as commonplace. For 
example. we all refer t S of our Internet coKnection in bits per second. 

There are a Re r gf reasons why ie systems now usually contain a bi- 
nary interfac tw source and chani. + why digital communication systems are 
now aalr ÂN ) ese will be explained with the necessary qualifications later, but briefly 
they are as follows. 

(1) Digital hardware has become so cheap, reliable, and miniaturized that digital in- 
terfaces are eminently practical. 

(2) A standardized binary interface between source and channel simplifies implementa- 
tion and understanding, since source coding/decoding can be done independently of the 
channel, and, similarly, channel coding/decoding can be done independently of the source. 

(3) A standardized binary interface between source and channel simplifies networking, 
which now reduces to sending binary sequences through the network. 

(4) One of the most important of Shannon’s information theoretic results is that if a source 
can be transmitted over a channel in any way at all, it can be transmitted using a binary interface 
between source and channel. This is known as the source/channel separation theorem. 

In the remainder of this paper, the problems of source coding and decoding and chan- 


nel coding and decoding are briefly introduced. First, however, the notion of layering in a 
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communication system is introduced. Standardized interfaces and layering large communi- 
cation systems such as the Public Switched Telephone Network (PSTN) and the Internet 
have incredible complexity. made up of an enormous variety of equipment made by differ- 
ent manufacturers at different times following different design principles. Such complex 
networks need to be based on some simple architectural principles in order to be under- 
stood, managed, and maintained. Two such fundamental architectural principles are stand- 
ardized interfaces and layering. 

A standardized interface allows the user or equipment on one side of the interface to ig- 
nore all details about the other side of the interface except for certain specified interface 


characteristics. For example. the binary interface in Figure 15. 1 allows the source coding/ 





communication systems is to break up communication fu s into a string of separate 


decoding to be done independently of the channel on ta he idea of layering in 


layers. Each layer consists of an input module at thexfiput of a communication system 


and a "peer" output module at the other end. T) nodule at layer 7 processes the in- 
formation received from layer i+ 1 and EA» processed information on to layer 
s 


i—]. The peer output module at layer iAyor the opposite direction. processing the re- 


ceived information from layer i— 1 an ding it on to layer i. As an example, an input 
module might receive a voice wa rom the next on and convert the wave- 
lower layer. The output peer 


å . mA . 
form into a binary data sequi hàt is passed on Msi 
e 


module would receive a binary sequence from the ower layer at the output and convert 


it back to a peso rm. x 
Xo ! source source ^J 







encoder 








binary 
interface 


destination 


Figure 15. 1 Placing a binary interface between source and channel, The source encoder converts the 
source output to a binary sequence and the channel encoder (often called a modulator) processes the 
binary sequence for transmission over the channel. The channel decoder (demodulator) recreates the 
incoming binary sequence (hopefully reliably) , and the source decoder recreates the source output. 


As another example. a modem consists of an input module (a modulator) and an out- 


put module (a demodulator). The modulator receives a binary sequence from the next 
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higher input layer and generates a corresponding modulated waveform for transmission over 
a channel. The peer module is the remote demodulator at the other end of the channel. It 
receives a more or less faithful replica of the transmitted waveform and reconstructs a typi- 
cally faithful replica of the binary sequence. Similarly. the local demodulator is the peer to 
a remote modulator Coften collocated with the remote demodulator above). Thus a modem 
is an input module for communication in one direction and an output module for independ- 
ent communication in the opposite direction. For now. it is enough simply to view the 
modulator as converting a binary sequence to a waveform. with the peer demodulator con- 
verting the waveform back to the binary sequence. As another example. the source coding/ 


decoding layer for a waveform source can be split into three layers. One of the advantages 







of this layering is that discrete sources are important topics ji own right and corre- 


spond to the inner layer. Quantization is also an inno ic in its own right. After 
both of these are understood, waveform sources DA simple to understand. The 
channel coding/decoding layer can also be split i. eVaral layers, but there are a number 
of ways to do this which will be discussed En 

Ir 


ing/decoding can be used as an outer la 






example. binary error correction cod- 
nodulation and demodulation as an inner 
layer. but it will be seen later that,thess hire a number of advantages in combining these 


layers into what is called code ulation, Even rode layering is important, 
s 


but the layers are defined di tly for different Se 
It should be emphasi£ed that layering is m € than simply breaking a system into 


components, The is d peer output inyéac! er encapsulate all the lower layers. and 
all these low NM an be viewed in ál te as a communication channel. Similarly. 
the higher layejs an be viewed in aggregate as a simple source and destination. 

The above discussion of layering implicitly assumed a point-to-point communication 
system with one source. one channel. and one destination, Network situations can be con- 
siderably more complex. With broadcasting. an input module at one layer may have multi- 
ple peer output modules. Similarly. in multiaccess communication a multiplicity of input 
modules have a single peer output module. It is also possible in network situations for a 
single module at one level to interface with multiple modules at the next lower layer or the 
next higher layer. The use of layering is at least as important for networks as it is for 


point-to-point communications systems. 
New Words and Phrases 


myriad adj. 无数 的 ; 种 种 的 telegraphy ». 电信 ; 电报 学 
n 无 数 ， 极 大 数量 telephony n. 电话 (学 ); 电话 制造 











eminently adv. 突出 地 ; 显著 地 
卓越 的 ; 支配 demodulator n. 解 调 器 
encapsulate vt. JR 4%; dE PNE 
将 …… 封 进 内 部 ;概述 
aggregate vi. 集合 ; 聚集 ; 合计 
vt. RAs 聚集 ; 合计 


methodology n. 方法 学 ， 方法论 


predominant adj. 主要 的 ; 
的 ; 有 力 的 ; 有 影响 的 
waveform n. (电子 ) 波 形 
electromagnetic adj. 电磁 的 
technophobic n. XU ERG. RAGE 





患者 ; 厌恶 计算 机 者 n. 合计 ; 集合 体 ; 总 计 
miniaturize vt. 使 小 型 化 ; 使 微型 化 adj. 聚合 的 ; 集合 的 ; 合计 的 


Exercises 


I. Translate the following phrases into Chine: 


1. portable device GN 
x ee 


5 

6 

7. ma co ication system 

8 X 
9 

1 


2. speech waveform 
3. image waveform 
4. text file 

binary sequence 


electromagneji ion 


; Me A 
. channel Separation theorem 


0. channel decoding 


I. Fill in the blanks with missing words from the table below. 




















capacity successive channel sensitive 
compression restricted distortion input 
essential modulation arbitrarily expense 
waveform inner discrete layers 
modulator analog probabilistic unintuitive 














l. 


Frequently the error probability incurred with simple 


techniques is too high. One possible solution is to separate the 





: first an error-correcting code, then a simple 


and demodulation 


encoder into two 








2. What Shannon showed was the very fact that more sophisticated coding 


schemes can achieve low error probability at any data rate below a value known as 
the channel . The channel capacity is a function of the description of the 
output conditional on each possible . Conversely. it is not possible to achieve low 


error probability at rates above the channel capacity. 
3. The output of an source, in the simplest case. is an analog real 
» representing, for example, a speech waveform. The word analog is used to em- 
phasize that the waveform can be arbitrary and is not to taking on amplitudes 
Írom some set of values. 


uences and fully ana- 


4. Discrete source coding is important both for discrete sources. such as text and com- 
puter files, and also as an layer for discrete-time a das 


log sources. It is to focus on the range of possib uts from the source rather 


than any one particular output. It is also important me n probabilistic models so as to 
wi 


achieve the best for the most commo; 
puts. 


5. Variable-length source coding is, KK lest way to provide good compression for 


th less care for very rare out- 






common source outputs at the rare outputs, The necessity to concatenate 





variable-length “St O. s to the A tic concept of unique decod- 


ability. 

6. There are some v: ZA to measurii "n in a mean-squared sense. For 
example. efficient s ers are based on od human speech. They make use of the fact 
that human li NS more — Mua. of reconstruction error than others, so as, 
for example, A it larger errors when the fiia is loud than when it is soft. 


II. Translate the following sentences into Chinese. 







JF 


l. The first is to understand analog data compression. i.e.. the compression of 
sources such as voice for which the output is an arbitrarily varying real- or complex-valued 
function of time; we denote such functions as waveforms. The second is to begin studying 
the waveforms that are typically transmitted at the input and received at the output of com- 


munication channels. The same set of mathematical tools is required for the understanding 





and representation of both source and channel waveforms. 
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2. A vector space is essentially a collection of objects (such as the collection of real n- 
tuples) along with a set of rules for manipulating those objects. There is a set of axioms 
describing precisely how these objects and rules work. Any properties that follow from 


those axioms must then apply to any vector space. i. e. . any set of objects satisfying those 


axioms. 

















3. Digital modulation (or channel encoding) is the proc nverting an input se- 






quence of bits into a waveform suitable for transmission o mmunication channel. De- 


t the receiver of converting 


modulation (channel decoding) is the MEE. 
the received waveform into a ( perhaps noisy) AR e input bit sequence. 




















gh Da ing，and Decoding 


This paper uses that characterization to retrieve the signal from the noise-corrupted re- 
ceived waveform. As one might guess. this is not possible without occasional errors when 
the noise is unusually large. The objective is to retrieve the data while minimizing the 
effect of these errors. This process of retrieving data from a noise-corrupted version is 
known as detection. 

Detection. decision making. hypothesis testing. and decoding are synonyms. The 
word detection refers to the effort to detect whether some phenomenon is present or not on 
the basis of observations. For example. a radar system uses observations to detect whether 
or not a target is present; a quality control system attempts to detect whether a unit is de- 
fective; a medical test detects whether a given disease is present. The meaning of detection 


has been extended in the digital communication field from a yes/no decision to a decision at 





the receiver between a finite set of possible transmitted signals. Such a decision between a 


set of possible transmitted signals is also called decoding. but here the possible set is usual- 


UNIT 15 DIGITAL COMMUNICATION 








ly regarded as the set of codewords in a code rather than the set of signals in a signal set. 
Decision making is. again. the process of deciding between a number of mutually exclusive 
alternatives. Hypothesis testing is the same, but here the mutually exclusive alternatives 
are called hypotheses. We use the word hypotheses for the possible choices in what fol- 
lows. since the word conjures up the appropriate intuitive image of making a choice be- 
tween a set of alternatives. where only one alternative is correct and there is a possibility of 
erroneous choice. 

These problems will be studied initially in a purely probabilistic setting. That is. there 


is a probability model within which each hypothesis is an event. These events are mutually 





exclusive and collectively exhaustive; i.e. . the sample outcome of the experiment lies in 
one and only one of these events. which means that in each p: ce of the experiment, 
one and only one hypothesis is correct, Assume there a potheses, labeled a, ,…， 
ay. The sample outcome of the experiment will xp ese M events, and this defines 
a random symbol U which, for each m. takes th se^ event occurs. The margin- 
al probability py Can) of hypothesis aw is X ,, and is called the a-priori probability 
of a,. There is also a random variable ( ^M the observation. A sample value v of 
V is observed. and on the basis of Wa the detector selects one of the possible 
M hypotheses. The ii^ qually well aie «sits x random variable, a random 
vector, a random unir zi iom symbol; A Po are discussed in what 
follows. 


Before discu 





d. to make decisions, a to understand when and why 
decisions mu: NE For a binary e + assume that the conditional probability of 
hypothesis oy the observation is 2/3 and that of hypothesis a; is 1/3. Simply deciding 
on hypothesis a) and forgetting about the probabilities throws away the information about 
the probability that the decision is correct. However. actual decisions sometimes must be 
made. In a communication system. the user usually wants to receive the message (even 
partly garbled) rather than a set of probabilities. In a control system. the controls must 
occasionally take action. Similarly. managers must occasionally choose between courses of 
action. between products. and between people to hire. In a sense, it is by making decisions 
that we return from the world of mathematical probability models to the world being mod- 
eled. 

There are a number of possible criteria to use in making decisions. Initially assume 
that the criterion is to maximize the probability of correct choice. That is. when the experi- 
ment is performed. the resulting experimental outcome maps into both a sample valuea,, for 
U and a sample value v for V. The decision maker observes v (but not a,,) and maps v into 


a decision u(v) . The decision is correct if 4(v) = a, . In principle. maximizing the proba- 
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bility of correct choice is almost trivially simple. Given v. calculate pulv(a,, | v) for each 

possible hypothesis a„. This is the probability that a,, is the correct hypothesis conditional on v. 

Thus the rule for maximizing the probability of being correct is to choose t (v) to be that a,, for 
which pulv Can | v) is maximized. For each possible observation v. this is denoted by 

u(v) = arg maxpulv (an |v) (MAP rule) (15-1) 

where arg max means the argument m that maximizes the function. If the maximum is 

not unique, the probability of being correct is the same no matter which maximizing m is 

chosen, so, to be explicit, the smallest such m will be chosen, Since the rule (15-1) ap- 


plies to each possible sample output v of the random variable V, (15-1) also defines the se- 


is as a random symbol U(V) . The conditional AM pulvis called an a- 


posteriori probability. This is in contrast to the a-priori pr 


fore the observation of V. The decision rule in (15-1) à 
riori probability (MAP) rule. 


AP rule depends only on the condi- 


lected hypothes 











pu of the hypothesis be- 


s called the maximum a-poste- 


An important consequence of (15-1) is 






tional probability pulv and thus is compl rmined by the joint distribution of U and 


V. Everything else in the probability is irrelevant to making a MAP decision. 
When distinguishing between di nt decision rulessthe MAP decision rule in (15-1) 
will be denoted by dap Co) 一 3v e MAP rule ma em probability of correct deci- 
sion for each sample value v.^it^ilso maximizes bility of correct decision averaged 
over all v, To see reca, let "X ms decision rule, Since ŭmap maxi- 
mizes pulv Ca, S m. 
bul ay ) | 9 — pu (nw) | 3 > 0 for each rule D and observation v 
(15-2) 
Taking the expected value of the first term on the left over the observationV. we get 
the probability of correct decision using the MAP decision rule. The expected value of the 
second term on the left for any given I is the probability of correct decision using that rule. 
Thus. taking the expected value of (15-2) over V shows that the MAP rule maximizes the 
probability of correct decision over the observation space. The above results are very sim- 
ple. but also important and fundamental. They are summarized in the following theorem. 
Before discussing the implications and use of the MAP rule. the above assumptions are re- 
viewed. First. a probability model was assumed in which all probabilities are known. and 
in which, for each performance of the experiment. one and only one hypothesis is correct. 
This conforms very well to the communication model in which a transmitter sends one of a 
set of possible signals and the receiver. given signal plus noise. makes a decision on the 


signal actually sent. It does not always conform well to a scientific experiment attempting 











to verify the existence of some new phenomenon; in such situations. there is often no sen- 
sible way to model a-priori probabilities. Detection in the absence of known a-priori proba- 
bilities is discussed in later parts. 

The next assumption was that maximizing the probability of correct decision is an ap- 
propriate decision criterion. In many situations. the cost of a wrong decision is highly 
asymmetric. For example. when testing for a treatable but deadly disease. making an error 
when the disease is present is far more costly than making an error when the disease is not 
present. It is easy to extend the theory to account for relative costs of errors. With the 
present assumptions. the detection problem can be stated concisely in the following proba- 
bilistic terms. There is an underlying sample space. a probability measure. and two rvs U 


and V of interest. The corresponding experiment is performeg. server sees the sample 







value v of rv V, but does not observe anything else. parti not the sample value of U, 


say am. The observer uses a detection rule. uv) . m unction mapping each possi- 
ble value of v to a possible value of U. If u (v) thé/detection is correct; otherwise an 


error has been made. The above MAP RS 





es the probability of correct detection 
conditional on each v and also maximize; Se sion probability of correct detection, 


Obviously. the observer must SD ditional probability assignment pulv in order to 


use the MAP rule. X 


New Words and Ph 


m V 
hypothesis 7. c es 7。 标 准 


conjure vt. 想象 conform vi. 符合 ; 遵照 ;! 适应 环境 
intuitive adj. 直觉 的 ; 赁 直觉 获知 的 vt， 使 遵守 ; 使 一 致 ， 使 顺从 
erroneous adj. 错误 的 ; 不 正确 的 asymmetric adj. 不 对 称 的 ; 非 对 称 的 
probabilistic adj. 概率 的 a-posteriori adj. 后 验 的 


garble vt. 断章取义 ; Elh; RIS 
n 断章取义 ; 混淆; 算 改 


Exercises 


I. Answer the following questions. 


1. What does "detection" refer to in this paper? 
2. What is the MAP rule? 


3. Which decision criterion is appropriate to decoding? 








I. Translate the following sentences into Chinese. 


1. Detection. decision making. hypothesis testing. and decoding are synonyms. The 
word detection refers to the effort to detect whether some phenomenon is present or not on 
the basis of observations. For example. a radar system uses observations to detect whether 
or not a target is present; a quality control system attempts to detect whether a unit is de- 


fective; a medical test detects whether a given disease is present. 




















2. Asone might guess. this is not AE ccasional errors when the noise is 


unusually large. The objective is to retrieve while minimizing the effect of these 


errors. This process of retrieving data K oise-corrupted version is known as detec- 


yy 


3. The meaning of detection has been extended in the digital communication field from 


tion, 


























a yes/no decision to a decision at the receiver between a finite set of possible transmitted 





signals. Such a decision between a set of possible transmitted signals is also called deco- 
ding, but here the possible set is usually regarded as the set of codewords in a code rather 


than the set of signals in a signal set. 





























Abstract Reading 


Spatially-coupled LDPC Codes for Decode-and-forward Relaying of Two Correlated 
Sources over the BEC 


We present a decode-and-forward transmission scheme based on spatially-coupled low- 


density parity-check (SC-LDPC) codes for a network consisting of two (possibly correla- 


ted) sources. one relay. and one destination. The links bet nodes are modeled as 
binary erasure channels. Joint source-channel coding with hannel decoding is used to 
exploit the correlation. The relay performs network ng) We derive analytical bounds 






on the achievable rates for the binary erasure iVision multiple-access relay channel 


with correlated sources. We then design bila SG+LDPC codes and analyze their asymp- 

totic performance for this scenario. We atialytically that the proposed coding scheme 

achieves the theoretical limit for sy: © channel conditions and uncorrelated sources. 

Using density evolution, we furt nore demonstrate viris scheme approaches the the- 
a Wi 


oretical limit also for non-syarpetric channel condi ies: 

ted. and we observe G saturation e is is typical for spatially-coupled sys- 
tems. Finally. "ses nulation results SN o lengths. which validate the DE a- 
nalysis. Xo j Re 


A Tractable Approach to Coverage and Rate in Cellular Networks 


hen the sources are correla- 


Cellular networks are usually modeled by placing the base stations on a grid. with mo- 
bile users either randomly scattered or placed deterministically. These models have been 
used extensively but suffer from being both highly idealized and not very tractable. so com- 
plex system-level simulations are used to evaluate coverage/outage probability and rate. 
More tractable models have long been desirable. We develop new general models for the 
multi-cell signal-to-interference-plus-noise ratio (SINR) using stochastic geometry. Under 
very general assumptions, the resulting expressions for the downlink SINR CCDF (equiva- 
lent to the coverage probability) involve quickly computable integrals, and in some practi- 
cal special cases can be simplified to common integrals (e. g. . the Q-function) or even to 
simple closed-form expressions. We also derive the mean rate. and then the coverage gain 


Cand mean rate loss) from static frequency reuse. We compare our coverage predictions to 








the grid model and an actual base station deployment. and observe that the proposed model 
is pessimistic (a lower bound on coverage) whereas the grid model is optimistic. and that 
both are about equally accurate. In addition to being more tractable. the proposed model 
may better capture the increasingly opportunistic and dense placement of base stations in 


future networks. 


A Technique for Orthogonal Frequency Division Multiplexing Frequency 
Offset Correction 


This paper discusses the effects of frequency offset on the performance of orthogonal 
frequency division multiplexing (OFDM) digital communicatjons e main problem with 
Írequency offset is that it introduces interference among S ltiplicity of carriers in the 
OFDM signal. It is shown, and confirmed by XL at to maintain signal-to-inter- 
ference ratios of 20 dB or greater for the OFD! ieFS/ offset is limited to 4% or less of 
the intercarrier spacing. Next, the paper RON technique to estimate frequency offset 
using a repeated data symbol. A maximüm ihood estimation (MLE) algorithm is de- 


rived and its performance computed a pared with simulation results. Since the inter- 
NÉ both F^. eid to the estimate, 

accurate estima ef when the offset is far too great 
to demodulate the cos Mies Also, the estimati WE depends only on total symbol en- 
ergy so it is innate to channel spreadi NA selective fading. A strategy is 


described ford uisition in the evil fruncertainty in the initial offset that exceeds 
1/2 the carrier Spacing. the limit of the MLE algorithm. 


carrier interference energy ani 
Í ~ 
the algorithm generates ext 
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DIGITAL SIGNAL PHOCESSING 


Text: The Breadth and De SPI 


Digital Signal Processing is one of the mos ful technologies that will shape sci- 






ence and engineering in the twenty-first evolutionary changes have already been 


made in a broad range of fields: comi ions, medical imaging. radar &- sonar, high fi- 
delity music reproduction. and oi ceting, to name j few. Each of these areas has 


developed a deep DSP tech ith its own NL mathematics. and specialized 





techniques, This ACE. of breadth and Msc s it impossible for any one individ- 





ual to master all ofthe DSP ; um tha en developed DSP education involves 





two tasks: EN ning"guneral concepts tha to the field as a whole. and learning spe- 
cialized techni iq for your particular area bf interest. This chapter starts our journey into 
the world of Digital Signal Processing by describing the dramatic effect that DSP has made 


in several diverse fields. The revolution has begun. 
The Roots of DSP 


Digital Signal Processing is distinguished from other areas in computer science by the 
unique type of data it uses: signals. In most cases. these signals originate as sensory data 
from the real world: seismic vibrations, visual images. sound waves. etc.. DSP is the 
mathematics. the algorithms. and the techniques used to manipulate these signals after 
they have been converted into a digital form. This includes a wide variety of goals. such 
as: enhancement of visual images, recognition and generation of speech, compression of 
data for storage and transmission. etc.. Suppose we attach an analog-to-digital converter 


to a computer and use it to acquire a chunk of real world data. DSP answers the question: 
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What next? 

The roots of DSP are in the 1960s and 1970s when digital computers first became a- 
vailable. Computers were expensive during this era. and DSP was limited to only a few 
critical applications, Pioneering efforts were made in four key areas; radar & sonar, where 
national security was at risk; oil exploration, where large amounts of money could be 
made; space exploration. where the data are irreplaceable; and medical imaging. where 
lives could be saved. The personal computer revolution of the 1980s and 1990s caused DSP 
to explode with new applications. Rather than being motivated by military and government 
needs, DSP was suddenly driven by the commercial marketplace. Anyone who thought 
they could make money in the rapidly expanding field was suddenly a DSP vendor. DSP 


reached the public in such products as; mobile telephones. c isc players. and elec- 
tronic voice mail. NN 


This technological revolution occurred from aa In the early 1980s, DSP 
was taught as agraduate level course in electrica ine&ing. A decade later, DSP had be- 
come a standard part of the a eee . Today. DSP is a basic skill needed 


analogy. DSP can be compared to a previ- 







by scientists and engineers in many fiel 
ous technological revolution; electron! hile still the realm of electrical engineering. 
nearly every scientist and engineekH&s some ei circuit design. Without it, 
they would be lost in the vago world. DS! Msi Same future. 

This recent a ll than a curiosity; E a tremendous impact onyour ability 
to learn and use TES ppose you eoe Nr problem. and turn to textbooks or 
other publicatis td find a solution, W will typically find is page after page of e- 
quations. obsc mathematical symbols. and unfamiliar terminology. It’s a nightmare! 
Much of the DSP literature is baffling even to those experienced in the field. It's not that 
there is anything wrong with this material, it is just intended for a very specialized audi- 
ence. State-of-the-art researchers need this kind of detailed mathematics to understand the 
theoretical implications of the work. 

A basic premise of this book is that most practical DSP techniques can be learned and 
used without the traditional barriers of detailed mathematics and theory. The Scientist and 
Engineer's Guide to Digital Signal Processing is written for those who want to use DSP as a 
tool. not a new career. 

The remainder of this paper illustrates areas where DSP has produced revolutionary 
changes. As you go through each application. notice that DSP is very interdisciplinary. re- 
lying on the technical work in many adjacent fields. The borders between DSP and other 
technical disciplines are not sharp and well defined. but rather fuzzy and overlapping. If 


you want to specialize in DSP. these are the allied areas you will also need to study. 
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Telecommunications 


Telecommunications is about transferring information from one location to another. 
This includes many forms of information: telephone conversations. television signals, 
computer files. and other types of data. To transfer the information. you need achannel 
between the two locations. This may be a wire pair. radio signal. optical fiber. etc.. Tele- 
communications companies receive payment for transferring their customer's information. 
while they must pay to establish and maintain the channel. The financial bottom line is 
simple: the more information they can pass through a single channel. the more money they 
make. DSP has revolutionized the telecommunications indus any areas; signaling 


tone generation and detection. frequency band shifting. ing to remove power line 


hum, etc.. Three specific examples from the wigs work will be discussed here: 


multiplexing, compression. and echo control. s$ 


Multiplexing 


There are approximately AiD telephones A At the press of a few 
buttons, switching eme any one of thes set nnected to any other in only a 
few seconds. The i this task is mi Nt 


between two tele s required passing t voice signals through mechanical swit- 
— 

ches and am NOH ne connection its pair of wires. In comparison, DSP con- 

verts audio sigmal$ into a stream of serial digital data. Since bits can be easily intertwined 


and later separated. many telephone conversations can be transmitted on a single channel. 


ing! Until the 1960s. a connection 


For example. a telephone standard known as the T-carrier system can simultaneously 
transmit 24 voice signals. Each voice signal is sampled 8 000 times per second using an 8 bit 
companded (logarithmic compressed) analog-to-digital conversion. This results in each voice sig- 
nal being represented as 64 000 bps. and all 24 channels being contained in 1. 544 Mbps. This 
signal can be transmitted about 6 000 feet using ordinary telephone lines of 22 gauge copper wire, 
a typical interconnection distance. The financial advantage of digital transmission is enormous. 


Wire and analog switches are expensive; digital logic gates are cheap. 
Compression 


When a voice signal is digitized at 8 000 samples/sec. most of the digital information is 


redundant. That is. the information carried by any one sample is largely duplicated by the 
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neighboring samples. Dozens of DSP algorithms have been developed to convert digitized 
voice signals into data streams that require fewer bps. These are called data compression 
algorithms. Matching uncompression algorithms are used to restore the signal to its origi- 
nal form. These algorithms vary in the amount of compression achieved and the resulting 
sound quality. In general, reducing the data rate from 64 kbps to 32 kbps results in no loss 
of sound quality. When compressed to a data rate of 8 kbps. the sound is noticeably affect- 
ed. but still usable for long distance telephone networks. The highest achievable compres- 
sion is about 2 kbps. resulting in sound that is highly distorted. but usable for some appli- 


cations such as military and undersea communications. 


Echo Control qe 


Echoes are a serious problem in long distance ade mnections. When you speak 
into a telephone, a signal representing your voic: veto the connecting receiver. where 
a portion of it returns as an echo. If the conheet s within a few hundred miles, the e- 
lapsed time for receiving the echo is onl K illiseconds. The human ear is accustomed 


to hearing echoes with these small ti ays, and the connection sounds quite normal. 
As the distance becomes larg AA o becomes Word noticeable and irritating. 
The delay can be several apa milliseconds for j of inental communications, and is 
particularly BUG m ga 

Digital SignalP ces ing attacks this Minen by measuring the returned signal 
and generating canes d el the offending echo. This same technique 
allows s. users to hear and speak at the same time without fighting audio feed- 
back (squealing). It can also be used to reduce environmental noise by canceling it with 


digitally generated antinoise. 


New Words and Phrases 


sonar n. JEM; 声波 定位 仪 使 难 理解 

sensory adj. 感觉 的 ; 知觉 的 ; 传递 感觉 的 ”baffle vt. ee 困惑 ; 使 …… 受挫 折 ; 用 
seismic adj. 地 震 的 挡 板 控制 

vibration n. 振动 interdisciplinary adj. 各 学 科 间 的 

chunk n. Ktk fuzzy adj. 模糊 的 ; 失真 的 

obscure adj. PERAJ. MEAD; BEEEHJ. A overlap vt. jese MB; jere 同时 发 生 





清楚 的 ; 隐蔽 的 ; 不 著名 的 无 名 的 filter n. 滤波 器 
wi Ee 模糊 不 清 ， 掩 盖 ; 隐藏 multiplexing n. 9852 Hi 























echo n， 回 音 ; 效仿 
immensity n. EK; 无 限 
boggle vi. ff. 退缩 ; Don 
vi. HR. FE 坏 ; 使 …… 
使 TM 困惑 

HR. iss 惊奇 
intertwine vt. 缠绕; 纠缠 
compand n. EAI JE 





对 数 的 


logarithmic adj. 


Exercises 


I. Translate the following phrases into Chii 


"s 


Digital Signal Processing 
medical imaging 

. seismic vibration 

circuit design 

financial bottom li zx 


echo control 


analog wig 
eal PS NP 
data stf£e 


10. elapsed time 


9e xn m 


lgnals 
al conversion 
9. 








megabits 2。 兆 位 
gauge n. 计量 器 ;标准 尺寸 ; 容量 规格 
vt. Wi. 估计 ; 给 …… 定 规格 


复制 ; 使 加 倍 
副本 ; 复制 品 
扭曲 ;使 失真 ;曲解 
消逝 ;时 间 过 去 ; 

n. 流逝; 时间 的 过 去 


K 
a 


duplicate vt. 
n. 
distort vt. 


elapse vi. 


JU 


Il. Fill in the blanks with missing words from the table below. 

















application filters reactions continuously 

functionality systematic linear fundamentally 
diagnostic sensors classification implement 
analog hypotheses compliance capabilities 
embedded sequence processing components 














l. The primary traits of 


from general purpose computer systems are their predictable 


from the environment. their form-and cost-optimized design. and their 


signal processing systems that distinguish them 
to real-time stimuli 


with re- 





信息 与 通信 专业 英语 








quired or specified modes of response behavior and 
2. An embedded system usually consists of hardware such as memories. 
-specific ICs (ASICs). processors. DSPs. buses. analog-digital interfaces. and 
also software components that provide control. . and application-specific 


required of it. In addition, they often contain electromechanical (EM) compo- 


nents such as and transducers and operate in harsh environmental conditions. 

3. Digital signal processing methods require that signals are quantized at 
discrete time instances and represented as a of words consisting of 1 s and 0's. In 
nature. signals are usually nonquantized and varied with time. Natural signals 


such as air pressure waves as a result of speech are converted by a-transducer to a propor- 





tional electrical signal. 

4. Digital are widely used in processing digi nals of many diverse appli- 
cations, including speech processing and data comm ions, image and video , 
sonar, radar. seismic and oil exploration. and electronics. One class of digital 
filters, the shift-invariant (LSD iy he most frequently used because they 
are simple to analyze. design. and ey 


5. Detection and classification ari Signal processing problems whenever a decision 





is to be made among a finite o conce: 


Da observed waveform. Signal 
her the ERE 5 of "noise alone" or "signal 







detection algorithms decide 
masked by noise”. Signakclassification algorithm, le whether a detected signal belongs 
to one or ie ca ecified classes "^e The objective of signal detection and 

MSS é-Specify diss for designing algorithms which minimize 


the average number of decision errors. 


II. Translate the following sentences into Chinese. 


l. Images are signals with special characteristics. First. they are a measure of a pa- 
rameter over space (distance), while most signals are a measure of a parameter over time. 
Second. they contain a great deal of information. For example. more than 10 megabytes 
can be required to store one second of television video. This is more than a thousand times 
greater than for a similar length voice signal. Third. the final judge of quality is often a 
subjective human evaluation. rather than an objective criterion. These special characteris- 


tics have made image processing a distinct subgroup within DSP. 
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2. Sometimes. you just have to make the most out of a bad picture. This is frequently 
the case with images taken from unmanned satellites and space exploration vehicles. No 
one is going to send a repairman to Mars just to tweak the knobs on a camera! DSP can im- 
prove the quality of images taken under extremely unfavorable conditions in several ways: 
brightness and contrast adjustment, edge detection, noise reduction, focus adjustment, 
motion blur reduction, etc.. Images that have spatial distortion, such as encountered when 


allowing the informa- 


a flat image is taken of a spherical planet, can also bewarped into a correct representation. 
Many individual images can also be combined into a single d Xs 


tion to be displayed in unique ways. For example. a AS quence simulating an aerial 


flight over the surface of a distant planet. 


























Ww 


odi e 
Re ing: The Breadth and Depth of DSP II 


Audio Processing 


The two principal human senses are vision and hearing. Correspondingly, much of 
DSP is related to image and audio processing. People listen to both music and speech. DSP 


has made revolutionary changes in both these areas. 

Music 

The path leading from the musician's microphone to the audiophile's speaker is re- 
markably long. Digital data representation is important to prevent the degradation com- 


monly associated with analog storage and manipulation. This is very familiar to anyone 


who has compared the musical quality of cassette tapes with compact disks. In a typical 
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scenario. a musical piece is recorded in a sound studio on multiple channels or tracks. In 
some cases. this even involves recording individual instruments and singers separately. 
This is done to give the sound engineer greater flexibility in creating the final product. The 
complex process of combining the individual tracks into a final product is called mix down. 
DSP can provide several important functions during mix down, including: filtering, signal 
addition and subtraction, signal editing. etc. . 

One of the most interesting DSP applications in music preparation is artificial reverber- 
ation. If the individual channels are simply added together. the resulting piece sounds frail 
and diluted. much as if the musicians were playing outdoors. This is because listeners are 
greatly influenced by the echo or reverberation content of the musig, which is usually mini- 
mized in the sound studio. DSP allows artificial echoes and ry tion to be added dur- 
ing mix down to simulate various ideal listening environm choes with delays of a few 


hundred milliseconds give the impression of us ations. Adding echoes with 
e 


delays of 10-20 milliseconds provide the BANS 

Speech Generation RY 
Speech generation and recogniti e to communicate between humans and ma- 
NN Yon and ears, This is very 


modest size listening rooms. 





chines. Rather than using your hi nd eyes. you usi 
convenient when your hand® eyes should be something else. such as; driving a 
car. performing surger r (unfortunately) fixi ur weapons at the enemy. Two ap- 


proaches are used onjputer generated see igital recording and vocal tract simula- 
一 





tion. In digitawecording, the voice of a peaker is digitized and stored. usually in a 
compressed PN During playback, the stored data are uncompressed and converted back 
into an analog signal. An entire hour of recorded speech requires only about three mega- 
bytes of storage, well within the capabilities of even small computer systems. This is the 
most common method of digital speech generation used today. 

Vocal tract simulators are more complicated, trying to mimic the physical mechanisms 
by which humans create speech. The human vocal tract is an acoustic cavity with resonant 
frequencies determined by the size and shape of the chambers. Sound originates in the vocal 
tract in one of two basic ways, called voiced and fricative sounds. With voiced sounds, vo- 
cal cord vibration produces near periodic pulses of air into the vocal cavities. In compari- 
son, fricative sounds originate from the noisy air turbulence at narrow constrictions, such 
as the teeth and lips. Vocal tract simulators operate by generating digital signals that re- 
semble these two types of excitation. The characteristics of the resonate chamber are simu- 
lated by passing the excitation signal through a digital filter with similar resonances. This 


approach was used in one of the very early DSP success stories. the Speak & Spell. a wide- 
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ly sold electronic learning aid for children. 





Speech Recognition 


The automated recognition of human speech is immensely more difficult than speech 
generation. Speech recognition is a classic example of things that the human brain does 
well. but digital computers do poorly. Digital computers can store and recall vast amounts 
of data, perform mathematical calculations at blazing speeds. and do repetitive tasks with- 
out becoming bored or inefficient. Unfortunately. present day computers perform very 
poorly when faced with raw sensory data. Teaching a computer to send you a monthly elec- 
tric bill is easy. Teaching the same computer to understand your veice is a major undertak- 
ing. 

Digital Signal Processing generally approaches the pr of voice recognition in two 
steps: feature extraction followed by feature matching Each word in the incoming audio 
signal is isolated and then analyzed to identify t excitation and resonate frequen- 
cies. These parameters are then compared PA ibus examples of spoken words to iden- 


tify the closest match, Often. these sys limited to only a few hundred words; can 






only accept speech with distinct pauses een words; and must be retrained for each in- 


dividual speaker. While this is a € for many Mus applications. these limita- 
tions are humbling when coi d to the me oo hearing. There is a great deal 
ci 


of work to be done in aw with tremend 
successful comme products. x NY 

/ Re 
Echo Location 


A common method of obtaining information about a remote object is to bounce a wave 





ial rewards for those that produce 


off of it. For example. radar operates by transmitting pulses of radio waves. and exami- 
ning the received signal for echoes from aircraft. In sonar. sound waves are transmitted 
through the water to detect submarines and other submerged objects. Geophysicists have 
long probed the earth by setting off explosions and listening for the echoes from deeply 
buried layers of rock, While these applications have a common thread. each has its own 
specific problems and needs. Digital Signal Processing has produced revolutionary changes 


in all three areas. 
Radar 


Radar is an acronym for Radio Detection And Ranging. In the simplest radar system. 


a radio transmitter produces a pulse of radio frequency energy a few microseconds long. 
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This pulse is fed into a highly directional antenna. where the resulting radio wave propa- 
gates away at the speed of light. Aircraft in the path of this wave will reflect a small por- 
tion of the energy back toward a receiving antenna. situated near the transmission site. 
The distance to the object is calculated from the elapsed time between the transmitted pulse 
and the received echo. The direction to the object is found more simply; you know where 
you pointed the directional antenna when the echo was received. 

The operating range of a radar system is determined by two parameters: how much en- 
ergy is in the initial pulse. and the noise level of the radio receiver. Unfortunately. increas- 
ing the energy in the pulse usually requires making the pulse longer. In turn. the longer 


conflict between two important parameters; the ability to de ts at long range. and 


pulse reduces the accuracy and precision of the elapsed time jore: This results in a 
the ability to accurately determine an object's distance. 

DSP has revolutionized radar in three areas. all hich relate to this basic problem. 
First. DSP can compress the pulse after it is receiced. viding better distance determina- 
tion without reducing the operating range. pos SP can filter the received signal to de- 
crease the noise. This increases the ra Out degrading the distance determination. 


Third, DSP enables the rapid selec and generation of different pulse shapes and 







lengths. Among other things, this ws the pulse to Stimized for a particular detec- 

tion problem. Now the imp vide part; much of thj at a sampling rate comparable 

to the radio frequency are high as severakl megahertz! When it comes to ra- 
bout hi 


dar. DSP is as "S igh-speed hors Sign as it is about algorithms. 
| Ke 


New sds Ph 


degradation n， 退 化; 降格， 降级 ; MUTA cavity n. W; i. MAb 
reverberation n. JQ Me; [m mj; 反射 反 resonant adj. 共振 的 ; 共鸣 的 





射 物 periodic adj. 周期 的 ; 定期 的 
frail adj. 脆弱 的 ;虚弱 的 excitation n. 激发， 刺激 ; 激励 
dilute adj. 稀释 的 ; 淡 的 resonate vt. 共振 

vi. FORE; 冲淡 ; 削弱 resonance n. 共振 

simulate vt. 仿真 blazing adj. 燃烧 的 ; 强烈 的 ; DER 
cathedral n. 大 教堂 extraction n. 取出; 抽出 ; Rih; 抽出 物 
megabyte n. JEF humble adj. 谦逊 的 ; 简陋 的 ; (级别 或 地 
vocal adj. 歌唱 的 ; 声音 的 ， 有 声 的 位 ) 低 下 的 ; 不 大 的 

n， 声 乐 作品 ; 元 音 bounce n. 跳 ; 弹力 ; 活力 


acoustic adj. 声学 的 ; 音响 的 ; 听觉 的 vi. 弹跳 ; 使 弹 起 
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acronym 7。 首 字母 缩 略 词 antenna n. 天线 

pulse n. 脉冲 ; 脉搏 propagate vt. 传播 ;传送 
vt， 使 跳动 megahertz n. 兆赫 

Exercises 


I. Answer the following questions. 





rally approaches the 


1. What are the two approaches used for computer generated speech? 
2. What are the two steps that Digital Signal Process dos 


problem of voice recognition? 


3. What's the common method of obtaining oo bout a remote object? 


I. Translate the following sentences into 


~ 


1. The path leading {rom the musi crophone to the audiophile's speaker is re- 






markably long. Digital data represi is important to prevent the degradation com- 


monly associated with analog,st: eand HE o is very familiar to anyone 


who has compared the mu ai aly of cassette wes 
Sia 


2. The automated recognition of human speech is immensely more difficult than 


compact disks. 

















speech generation. Speech recognition is a classic example of things that the human brain 
does well. but digital computers do poorly. Digital computers can store and recall vast a- 
mounts of data. perform mathematical calculations at blazing speeds. and do repetitive 


tasks without becoming bored or inefficient. 

















3. A common method of obtaining information about a remote object is to bounce a 


wave off of it. For example. radar operates by transmitting pulses of radio waves. and ex- 
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amining the received signal for echoes from aircraft. In sonar. sound waves are transmitted 


through the water to detect submarines and other submerged objects. 

















Abstract Reading 


Source Separation Using Regularized NMF e 


Priors with Online Learning for the Uncertai 





imates under GMM 


We propose a new method to incor] 
factorization (NMF). The NMF so 


~ 
aN iors on the solution of nonnegative matrix 






3 guided to follow the minimum mean square er- 
ror (MMSE) estimates of the t combinations Xr a Gaussian mixture model 
(GMM) prior. The "— orithm can be use “an ising or single-channel source 
separation (SCSS) Vi em NMF is usedwir Eos in two main stages. the training 
stage and the as stage. In the —: NMF is used to decompose the train- 
ing data spec ranj for each source into iplication of a trained basis and gains matri- 
ces, In the PNG stage. the mixed signal spectrogram is decomposed as a weighted lin- 
ear combination of the trained basis matrices for the source signals. In this work. to im- 
prove the separation performance of NMF. the trained gains matrices are used to guide the 
solution of the NMF weights during the separation stage. The trained gains matrix is used 
to train a prior GMM that captures the statistics of the valid weight combinations that the 
columns of the basis matrix can receive for a given source signal. In the separation stage. 
the prior GMMs are used to guide the NMF solution of the gains/weights matrices using 
MMSE estimation. The NMF decomposition weights matrix is treated as a distorted image 
by a distortion operator. which is learned directly from the observed signals. The MMSE 
estimate of the weights matrix under the trained GMM prior and log-normal distribution 
for the distortion is then found to improve the NMF decomposition results. The MMSE es- 
timate is embedded within the optimization objective to form a novel regularized NMF cost 
function. The corresponding update rules for the new objectives are derived in this paper. 


The proposed MMSE estimates based regularization avoids the problem of computing the 
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hyper-parameters and the regularization parameters. MMSE also provides a better estimate 





Íor the valid gains matrix. Experimental results show that the proposed regularized NMF 
algorithm improves the source separation performance compared with using NMF without a 


prior or with other prior models. 


Robust Feature Extraction Based on an Asymmetric Level-dependent Auditory Fil- 
terbank and a Subband Spectrum Enhancement Technique 


In this paper we introduce a robust feature extractor. dubbed as robust compressive 
gammachirp filterbank cepstral coefficients (RCGCC), based on an asymmetric and level- 


dependent compressive gammachirp filterbank and a sigmoid, s eighting rule for the 






enhancement of speech spectra in the auditory domain. T| of this work is to improve 


the robustness of speech recognition systems in additi Xs nd real-time reverberant en- 
vironments. As a post processing scheme we M time feature normalization 
technique called short-time cepstral mean rav ormalization (STCMSN), which, by 
adjusting the scale and mean of cepstral «lie SM the difference of cepstra between 
the training and test environments. Tlormance evaluation, in the context of speech 


recognition. of the proposed fgat&r&eXtractor we use gaged noisy AURORA-2 con- 
nected digit corpus, the meet recorder digits ( s subset of the AURORA-5 cor- 
pus. and the AUROR CSR corpus. whi WE additive noise. reverberant a- 
coustic conditions vad tive noise as wellzas different microphone channel conditions, re- 


spectively. T TSPadvanced front-en SI-AFE) the recently proposed power nor- 
tralvcóefficients (PNCC), conventional MFCC and PLP features are used for 






malized ceps 
comparison purposes. Experimental speech recognition results demonstrate that the pro- 
posed method is robust against both additive and reverberant environments, The proposed 
method provides comparable results to that of the ETSI-AFE and PNCC on the AURORA- 
2 as well as AURORA- corpora and provides considerable improvements with respect to 


the other feature extractors on the AURORA-5 corpus. 
Directional Acoustic Source Orientation Estimation Using Only Two Microphones 


A simple physical model consisting of a point source displaced from its center of rota- 
tion. in combination with a directivity model that includes backwards emitted energy. is 
considered for the problem of estimating the orientation of a directional acoustic source. 
Such a problem arises. for instance. in voice-commanded devices in a smart room and is u- 


sually tackled with a large or distributed microphone array. We show. however. that when 








the time difference of arrival is also taken into account. a small array of only two micro- 
phones is sufficiently robust against unaccounted factors such as microphone directivity 
variation and mild reverberation. This is shown by comparing predicted and measured val- 
ues of binaural cues. and by using them and pairwise frame energies as inputs for an artifi- 


cial neural network (ANN) in order to estimate source orientation. 








